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(54) TiUe: INHIBITING APOPTOSIS WITH ADENOVIRUS RID PROTEIN 
(57) Abstract 

A method for inhibiting apoptosis of a cell expressing a death receptor of the TNFR family is disclosed. The method involves treating 
the cell with a Receptor Internalization and Degradation (RID) protein complex containing RIDa (10.4K) and RID/? (14JK) proteins 
encoded by the E3 region of adenovirus. The cell can be treated by administering to the cell a polynucleotide expressing the RID complex 
or by administering to the cell a composition containing the RID complex. Compositions containing a RID complex are also disclosed. 
The compositions and method are useful in the treatment of cancer, degenerative and immune disorders, as well as in promoting sur\ ival 
of tissue transplants. An adenovirus vector for delivering the RID complex to cells is also disclosed. 
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Inhibiting Apoptosis with Adenovirus RID Protein 
Reference to Government Grant 

This invention was made with government support under Grant Number ROl 
CAS 853 8. The government has certain rights in this invention. 

5 Related Application 

This application claims priority to U.S. Provisional Application serial number 
60/088,993, filed July 9. 1997, which is incorporated herein in its entirety by reference. 

Background of the Invention 
10 ( I ) Field of the Invention 

This invention relates generally to the regulation of apoptosis and, more particularly, 
to a method for inhibiting apoptosis using the Adenovirus RID protein and to applications of 
this methods including promoting survival of tissue transplants, treating autoimmune disease, 
and promoting tumor destruction in cancer patients. 
1 5 (2) Description of the Related Art 

Apoptosis, or programmed cell death, plays a fundamental role in regulation of the 
immune system. For review, sec White, E. Genes iSc Development /0: 1-15, 1996; van Parijs. 
L. and Abbas, A.K., Curr. Opin. Immunol. S:355-361, 1996: Nagata, S., Cell 55:355-365, 
1997. In recent years researchers have shown that some members of the tumor necrosis factor 
20 (TNF) family of cytokines can induce apoptosis by binding to their specific receptors on 
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target cells. Nagata, 5upra; Baker , S.J. and Reddy, E.P., Oncogene 72:1-9, 1996. The 
receptors for the TNF family of cytokines belong to a family of proteins referred to as the 
TNFR family, which is characterized by an extracellular domam of highly conserved cysteine 
residues contained in cysteine-rich pseudorepeats (Chaudhary et al.. Immunity 7:821-830, 
5 1997). In addition, several members of the TNFR family possess a conserved cytoplasmic 
domain of approximately 80 amino acids called the death domain, which functions to mitiate 
an intracellular apoptotic signaling cascade upon binding of the appropriate cytokine. (See 
Chaudhary et ah, supra; Walczak et al„ EMBO J. 7(5:5386-5397, 1997.) TNFR proteins 
containing death domains comprise a death receptor subfamily which includes: TNFRl 
1 0 (Tartiglia et ah. Cell 74:845-853, 1993); Fas (also called CD95 and Apo- 1 ) (Itoh and Nagata, 
J,BioL Chem, 268: 10932-10937, 1993); death receptor 3 (DR3. also called TRAMP, Apo.3, 
Wsl-l, and LARD) (Chinnaiyan et aL, Science 274:990-992,1996; Kiston et al.. Nature 
354:372-375, 1996); TRAIL-Rl (also known as DR4) (Pan et aL. Science 276:11 l-l 13, 
1997); and TRAIL-RI (also called DR5) (Pan et al.. Science 277:815-818, 1997), The death 
1 5 domains of these proteins are shown in Figure 1 . 

Fas, the most studied death receptor, is expressed on the surface of most cell types, 
including epithelial cells, fibroblasts, T and B cells, liver hepatocytes and some tumor cells 
(Nagata, Nature Medicine 2: 1306-1307, 1996; French et al.. Nature Medicine 5:387-388, 
1997). However, FasL is primarily expressed by activated leukocytes of the immune system, 
20 including cytotoxic T lymphocytes ((TTL's) and natural killer (ts(K) cells (Nagata, Cell, 

supra). It is believed that the Fas ligand (FasL) plays a role in the immune response of these 
cells to induce apoptosis in target cells expressing Fas. Such target cells include vinis- 
infected cells and tumor cells. On the other hand, leukocytes also express Fas, which can 
result in down regulation of the immune response due to activated leukocytes killing each 
25 other (Nagata, Cell, supra). 

Recently, it was discovered that FasL is also expressed in inunune-privileged sites 
such as the eye chamber, parts of the nervous system, and testis and it is believed that any 
activated leukocytes entering such sites are immediately killed through the FasL-Fas 
apoptotic pathway, thereby preventing a potentially crippling immune response (Nagata, Ce//, 
30 supra). This finding could potentially be applied to preventing transplant rejection and, 

indeed, one group has reported that islet allografts were protected from immune rejection by 
CO transplantation with syngeneic myoblasts expressing functional FasL (Lau et al.. Science 
273:109-112, 1996). 

The discovery of FasL expression in immune-privileged sites led a number of groups 
35 to examine whether the means by which tumor cells avoid destruction is through expressi n 
of FasL. A number of tumor cell types were subsequently reported to constitutively express 
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FasL, including lymphoma and leukemia cells (Tanake, et al.. Nature Med, 2:317-322, 1996) 
various nonlymphoid carcinoma cells, including colon cancer (O'Connell, et al., 7. Exp. Med. 
/5¥: 1075-1082, 1996), hepatocellular carcinoma (Strand et al.. Nature Med. 21361-1366, 
1996) and melanoma (Hahne et al.. Science 274: 1363-1366, 1996), As a result of expressing 
5 FasL, many mmor cells have the ability to kill attacking CTL and NK cells thereby reducing 
the immune response against the tumor. In addition, it has been reported that some types of 
tumors become resistant to Fas-mediated apoptosis, either by downregulation of Fas 
expression or by other unknown mechanisms, and thereby avoid being killed by the 
infiltratmg leukocytes (Nagata, Nat, Med,, supra.; Strand et al., supra; Hahne et al., supra), 
1 0 Because alterations in Fas-FasL regulation, including upregulation of FasL expression and 

downregulation of Fas expression, may be involved in tumor cells avoiding destruction by the 
immune system, it would be desirable to devise an approach that would reduce the effect of 
such changes in Fas-FasL regulation. In one such approach it was recently reported that the 
anti-cancer drug doxorubicin enhances expression of both Fas and Fasl in tumor cells (Friesen 
1 5 et al.. Nature Med. 2:574-577, 1996). 

Recent reports have associated other disease states with dysfunction of the Fas 
system, including hypmosinophilic syndromes in humans (Lenardb et al., J. Exp. Med. 
755:721-724, 1996), hepatitis (Kondo et al., Nat. Med. 5:409-413, 1997) and the autoimmune 
disease Hashimoto's thyroiditis (HT) (Giordano et al.. Science 1 75:960-963, 1997). 
20 Consequently, it has been suggested that inappropriate upregulation of Fas may be a causal 
factor in other autoimmune diseases involving tissues which constitutively express FasL 
(French et al., supra). 

Human adenoviruses (used interchangeably herein with Ad), which cause disease in 
the respiratory tract, conjunctiva, intestine, urinary tract and liver, have evolved elaborate 
25 mechanisms to overcome host antiviral defenses, including at least four of the seven known 
proteins encoded by the early region 3 (E3) transcription unit which have been reported to 
inhibit the host inununc response to Ad-infected cells (Fcjer et al., J. Virol. 65:5871-5881, 
1994; Sparer etal.,y. ViroL 770:2431-2439, 1996). Oneof these proteins is a 19kDa 
glycoprotein (gpl9K), which inhibits CTL-mediated lysis of Ad-infected cells in vitro (Efrat 
30 etal., Proc. Natl. Acad Sci, 92:6947-6951, 1995). Three other E3 proteins, the 14.7Kprotein 
and 10.4K protein in combination with the 14.5K protein (referenced hereinafter as the 
10,4K/14.5K complex), protect adenovirus-infected cells against cytolysis and the 
inflammatory response induced by tumor necrosis factor-a (TNF-a) both in vitro and in vivo 
(Sparer et al., jM/yra; Krajcsietal., J. ViroL 70:4904^913, 1 996; Dimitrov et al., J. Virol. 
35 71 :2830-2837, 1997). Although the exact stoichiometry of 10.4K and 14.5K proteins in this 
complex is not known, it is believed to consist of one 14.5IC polypeptide in physical 
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association with a dimer formed by full-length and short forms of the 10.4K polypeptide 
joined in disulfide linkage. Stewart et aU supra. 

Efrat et al. have reported that the expression of the one of the Ad E3 genes, i.e. the 
gene encoding the l9kDa glycoprotein (gpl9K), can prolong survival of pancreatic islet 
5 allografts. The islets were obtained from transgenic animals prepared to contain the entire E3 
genomic DNA from human Ad, however, the gpl9K mRNA was prominently expressed with 
little or no expression of the I0.4K protein which makes up a portion of the 10.4/14.5 
complex. The islet allografts survived reportedly due to the expression of the gpl9K protein 
and there was no suggestion in this reference that the 10.4K or 14.5K proteins cither 
10 separately or in the 10.4K/14.5K complex played any role in the survival of the allografts. 
Nevertheless, the 10.4/14.5 complex can protect Ad-infected cells from the 
inflammatory response in the context of Ad infection (Sparer et aL, supra) and, although it 
has not been heretofore recognized, it is possible that the 10.4K/14.5K complex could also 
provide a novel basis for modulating the immune system in certain disease processes. 

15 

Summary of the Invention 

In accordance with the present invention, the inventor herein has succeeded in 
discovering that the Ad 10.4K/14 JK complex inhibits apoptosis mediated by death receptors, 
in particular Fas or TNFR-l, by removing the death receptor from the cell surface. The 
20 present invention, thus, provides a method for inhibiting apoptosis of a cell comprising 

treating the cell with an effective amount of a 10.4K/14.5K complex referenced herein as RID 
(Receptor Internalization and Death) or as RID complex. The RID complex reduces the 
number of molecules of one or more death receptors on the surface of the cell. This down- 
regulation of the death receptor results from internalization of the receptor to endosomes and 
25 degradation of the internalized death receptor by lysozymes. The RID complex is obtained 

from or derived from the RIDa and RID(3 proteins encoded by the Ad E3 region DNA. Other 
E3 region-encoded proteins, including the gpl9iC and 14.7K proteins, are not required to 
remove the death receptor from the cell surface or to induce apoptosis. Due to the similar 
structure of TNFR death receptors, and in the common pathway by which they mediate 
30 apoptosis, it is believed that RID can inhibit apoptosis mediated by all death receptor 
members of the TNFR family by promoting their removal from the cell surface. 

In one embodiment of the present invention, the cell is treated with RID by 
administenng to the cell a polynucleotide encoding the RID complex, through which the RID 
complex is expressed in the cell. Alternatively, the treating step comprises administering the 
35 RID complex to the cell, preferably in a earner that facilitates delivery of the complex into the 
cell. The method can be used to inhibit apoptosis of cells expressing one or more death 
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receptors of the TNFR family, including but not limited to Fas, TNFR-l, DR3, TRAIL-Rl 
and TRAIL-R2. Where the cell comprises a tissue, the method is useful for promoting 
survival of a tissue transplant in a patient or in promoting survival of a tissue under attack in a 
patient suffering from a degenerative disease, an immunodeficiency disease, an autoimmune 
disorder or other diseases associated with disregulation of apoptosis mediated by the TNFR 
death receptors. The method is also useful in inhibiting apoptosis of leukocytes mediated by 
tumor cells in cancer patients, thereby promoting leukocyte destruction of the patient's tumor 
cells. 

Accordingly, in another embodiment, the present invention provides a method for 
decreasing apoptosis of target cells in a patient comprising treating the patient with an 
effective amount of a RID complex. The target cells express a death receptor which is 
downregulated when RID enters the cells. 

In yet another embodiment, the invention provides a method for inhibiting leukocyte 
apoptosis in a patient comprising withdrawing leukocytes from the patient, treating the 
15 leukocytes with an effective amount of a RID complex, and administering the treated 
leukocytes to the patient 

In another embodiment, the present invention provides a composition comprising a 
RID complex in a carrier suitable for facilitating cntiy of the RID complex into a cell. As 
illustrated in Figure 3, a RID complex comprises at least three polypeptides: a full-length Ad 
20 E3 10.4K protein having two transmembrane domains (RIDa-L), a short form of the 1 0.4K 
protein with only one transmembrane domain (RIDa-S), and a 14.5K protein (RIDp). RID 
compositions intended for treating humans preferably contain a pharmaceutically acceptable 
carrier. In one embodiment, the carrier component of the composition comprises a liposome. 
The present invention also provides an Ad vector for expressing a RID complex in a 
25 cell and to cells transfccted with this vector. The vector comprises a nucleotide sequence 
encoding the RIDa and RID(i polypeptide components of the complex operably linked to a 
promoter capable of directing expression of the nucleotide sequence in the cell. A preferred 
vector consist of 231-10 (SEQ ID NO:2), which expresses functional polypeptides for all of 
the E3 genes other than adp, 
^0 Among the several advantages found to be achieved by the present invention, 

therefore, may be noted the provision of compositions and methods for inhibiting apoptosis of 
a cell expressing a death receptor, the provision of compositions and methods for promoting 
tissue transplant survival in patients; the provision of compositions and methods for treating 
patients suffering from an autoimmune disease and other disorders associated with 
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dysfunction of apoptosis regulation; and the provision of compositions and methods for 
promoting tumor destruction in cancer patients. 

Brief Description of the Drawings 
S . Figure 1 shows an alignment of the amino acid sequences of the death domains of the 

death receptor subfamily of TNFR proteins, with residues identical in more than 30% of 
sequences shaded black and residues conserved in more than 30% of sequences shaded in 
gray; 

Figure 2 is a schematic representation of apoptosis mediated by death domain- 
10 containing members of the TNF receptor superfamily, with the death receptors Fas, TNFR I, 
TRAIL-Rl, TRAIL-R2 and DR3 depicted by the bars on the extreme right and left sides of 
the figure, the ligands for these receptors indicated in parenthesis, and showing the 
association of the death receptors with intracellular proteins in the apoptotic singling cascade 
at the bottom of the figure; 
1 5 Figure 3 is a schematic representation of a preferred RID complex showing one 

mature 14.5K polypeptide having an O-glycosylated residue in the extracellular (or lumenal) 
domain and an 0*phosphorylated residue in the cytoplasmic domain, and two covalently- 
linked 10.4K polypeptides, one of which is an uncleaved, fixIMength form of 10.4K (10.4K- 
L) having two membrane-spanning regions (diagonal stripes) and the other a cleaved, short 
20 form of 10.4K (10.4K-S) with only one transmembrane region; 

Figure 4 illustrates the amino acid sequences and various domains of preferred 
embodiments of the RIDa and RIDP polypeptides, showing in Fig. 4A-4B the long and short 
forms of the E3 10.4K polypeptides (RIDa-L and RIDa-S) from Ad serotype 2, Fig. 4C the 
pre-l4.5K (RIDp) polypeptide of Ad serotype 5, and in Fig. 4D the mature 14.5K (RIDP) 
25 polypepdde of Ad serotype S, with the signal sequences and transmembrane domains 
underlined and the asterisks indicating sites for disulfide linkage in RIDa or for 
O-phosphorylaiion in RID(3; 

Figwe S is a schematic representation of a model for RID-induced internalization and 
degradation of Fas and TNFRl death receptors, showing RID and the death receptor in the 
30 plasma membrane, entry of RID and the death receptor into endosomes, transport of these 
endosomes to lysosomes where the death receptor is degraded, and recycling of RID in 
endosomes to the cell surface, where it can internalize another death receptor molecule; 

Figure 6 shows photographs of MCF7-Fas cells (Figs. 6A and 6B) infected with 
recTOO Ad ("wild-type**) or (Figs. 6C and 6D) transiently transfected with pMT2-RIDa plus 
35 pMT2-Rip which were then treated with an agonist monoclonal antibody to Fas and double- 
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Stained for the adenovirus-encoded DNA binding protein (anti-ADP) (Fig. 6A) and for DNA 
4, 6-diamidmo2-phcnylindolc (DAPI) (Fig. 6B) or doublc-siaincd for RIDP (Fig. 6C) and 
DNA (Fig. 6D), with the photographs taken using a lOOX Plan apo objective lens; 

Figure 7 shows flow cytometry tracings of MCF7-Fas cells which were mock- 
5 infected (Fig. 7A) or infected with wild-type Ad (Ad5 and reclOO) (Figs. 7B-7C) or with the 
indicated Ad E3 mutant (Figs. 7D-7H) and then incubated with antibodies to Fas (bold trace), 
transferrin receptor (dashed trace), or control IgG (light trace); 

Figure 8 shows flow cytometry tracings of A549 cells which were mock-infected 
(Fig. SB) or infected with wild-type Ad (rec700) (Fig. 8C) or with the indicated Ad E3 
1 0 mutant (Figs. 7D-7H) and then incubated with antibodies to Fas (red trace), transferrin 

receptor (blue trace), or control IgG (black trace), with the cell pattern for mock-infected cells 
shown in Fig. 8 A and Rl indicating the cells that were gated for the analysis; 

Figure 9 shows photographs of mock-infected MCF7 cells (Fig. 9A) or MCF7.Fas 
cells mock-infected (Fig. 9B) or infected with the indicated viruses (Figs. 9C-9H) and then 
1 5 analyzed for Fas by immunofluorescence, with the speckled pattern in Figs. 9C, 9G, and 9H 
representing putative endosomes and lysosomes containing Fas; 

Figure 10 shows an immunoblot of proteins extracted froni MCF-7 Fas cells 
following mock-infection or infection with the indicated wild-type and mutant Ads and 
stained for Fas (Fig. lOA), transfennn receptor (Fig. lOB) or Ad El A (Fig. IOC), with 
20 molecular weight markers indicated on the right; 

Figure 1 1 shows photographs of COS7 cells transfectied with expression plasmids for 
Fas and RIDa (Fig. 1 1 A, 1 IB), Fas and RIDP (Fig. 1 IC, 1 ID), or Fas, RIDa, and RIDP (Fig. 
1 IE- 1 1 H) and double-stained for RIDa and Fas (Fig. 1 1 A, 1 IB, 1 IE, 1 IF) or for RIDp and 
Fas (Fig. 1 IC, 1 ID, 1 IG, I IH) with arrow in Figs. I IG and H indicate vesicles that appear to 
25 contain both RIDP and Fas; 

Figure 12 shows photographs of rec700-infected AS49 cells double-stained for Fas 
and a lysosomal protein, LAMP I and examined by confocal microscopy, with Fig. 12A 
showing cells labeled with rabbit anti-Fas antibody and fluorescein isothiocyanate (FITC), 
Fig. 12B showing cells labeled with mouse anti-LAMP-1 antibody and rhodamine 
30 isothiocyanate (RITC). Fig. 12C showing the combined images of Fig. 12A and 12 B, and 

Fig. 12D showing a perpendicular view of the image in Fig. 12C (arrows), 1 \im thick, where 
green indicates Fas, red indicates LAMP- 1 and yellow indicates colocalization of Fas and 
LAMPl and the bar indicating a distance of 10 ^m; 
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Figures 13A-13C show photographs of immunofluorescence labeling of Fas in 
rec700-infectcd cells treated (Fig. 13A) or not treated (Fig. 13B) with bafilomycin Al (Baf). 
or in d/309 (RID>infected cells treated with Baf (Fig. I3C); 

Figure 1 3D shows an immunoblot of proteins extracted from mock-. reclOO- or 
5 <i/309-infected cells treated (+) or not treated (-) with bafilomycin Al (BaO and stained for 
Fas, ERp72, or Ad protein E1B.19K; 

Figure 13E shows the immunoblot of Fig. I3D following removal of antibody and 
restaining for transferrin receptor (TfR); 

Figure 14 shows an immunoblot of proteins extracted from COS7 cells transfected 
1 0 with various combinations of plasmids expressing Fas, Shp- 1 , RIDa or RIPp as indicated by 
the and signs and stained for Fas, Erp72 or Shp-1 using appropriate antisera, with the 
arrows indicating two groupings of bands which correspond to differently glycosylated 
species of Fas; 

Figure 15 shows an immunoblot of proteins extracted from COS7 cells transfected 
1 5 with various combinations of plasmids expressing Fas, chloramphenical acetyl-transferase 
(CAT), RIDa or RID(J as indicated by the and signs and stained for Fas, Erp72 or 
CAT using appropriate antisera, with the arrows indicating two groupings of bands which 
correspond to diffcrendy glycosylated species of Fas; 

Figures 16A and 16B are graphs of the amount of lysis of mock-, rec700- or rf/7001- 
20 infected Fas-positive mouse P815 ceils by activated cytotoxic lymphocyetes (CTL) from 
peforin (-/-) mice (Fig. 16A) or niatched perforin (+/+) mice (Fig. 16B) at effector 
lymphocyte:target ratios of 60: 1 (black bars), 20: 1 (stippled bars), or 6: 1 (open bars); 

Figure 16C shows flow cytometry tracings of P815 cells infected with rec700 (middle 
plot) or rfnOOO (right dark plot) and then stained for Fas, with the left plot showing the IgG 
25 control; 

Figure 17 is a graph of the amount of lysis of mock- or Ad-infected Fas-positive 
human A549 cells by natural killer (NK) cells at NK:A549 cell ratios of 10:1 (black bar) and 
5:1 (striped bar); 

Figure 18 shows flow cytometry tracings of human HeLa cells mock-infected (green 
30 trace) or infected with rec700 (red trace) or d/7 12, a mutant that overexpresses RID and E3- 
14.7K (blue trace) and then stained for TNFRl (Fig. 18A) or Fas (Fig. 18B), with the 
percentage of cells that stained positive for TNFRl or Fas indicated at the bottom; 

Figure 19 shows flow cytometry tracings of human HeLa cells mock-infected (black 
trace) or infected with rec700 (red trace), dllSZ (light blue trace), d/764 (dark blue trace), 
35 dni2 (green trace), dn09 (pink trace) and then stained for TNFRl (Fig. 19 A) or Fas (Fig. 
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19B). with the genotype of each virus and the percentage of cells that stained positive for 
TNFRl or Fas indicated at the bottom; 

Figure 20 shows flow cytometry tracings of human HeLa cells mock-infected (black 
trace) or infected with the 23 1-IO vector, which expresses only the E3 proteins, and then 
stamed for TNFRl at 24 hr. p.i.. (red trace) or 48 hr. p.i. (blue trace); 

Figure 21 shows an immunoblot of TNFRl extracted from A549 cells mock-infecied 
or infected with rec700 in which cell surface proteins were labeled by incubation with biotin 
at the indicated hour p.i.; 

Figure 22 shows an immunoblot of TNFRl (Fig. 22A) and RIDp (Fig. 22B) extracted 
from A549 cells mock-infected or infected with rec700 or the 23 1 -10 vector in which cell 
surface proteins were labeled by incubation with biotin at the indicated hour p.i.; 

Figure 23 A shows an immunoblot of TNFRl extracted from A549 cells mock- 
infected or infected with the indicated vims in which cell surface proteins were labeled by 
incubation with biotin at 26 h p.i.; 

15 Figure 23B shows an immunoblot of Ad EIB-19K protein extracted from the same 

cells used in Fig. 23A; 

Figure 24 shows a photograph of exposed skin and muscle of the hind flanks of a 
female C57B1/6 mouse sacrificed 18 days after the flanks were subcutaneously injected with 
human cancer A549 cells infected with the 231-10 vector, with A549 tumors appearing as 
20 whitish-tan masses on each flank; 

Figure 25 shows a closer view of the tumor on the right flank of the mouse in Fig. 24: 
Figure 26 shows an immunoblot of proteins extracted from an A549 tumor grown in a 
mouse such as described in Fig. 24; 

Figure 27 is a schematic illustration of the stnicnire of the genome of the Ad 231-10 
25 vector, with the black horizontal bar representing the backbone of the Ad5 genome, from 
which the B t and E3 regions are deleted, as indicated by the triangles below the black bar. 
and containing an expression cassette with the CMV promoter controlling the E3 genes 
inserted into the deleted El region, as indicated by the triangle to the left, above the black bar, 
with the transcription unit oriented from right to left as indicated by the arrowhead and 
30 restriction endonuclcase cleavage sites flanking the CMV-E3 cassette indicated; 

Figure 28 illustrates the nucleotide sequence of the 23 1 -10 genome with the 
numbering beginning with the first base-pair on the conventional left side of the Ad5 genome 
as shown in Fig. 27 and proceeding to the last base-pair at the right side of the genome; 

Figure 29 shows an immunoblot of E3 RIDp. 14.7K. and gpl9K proteins expressed in 
35 A549 cells infected with the 23 1 - 10 vector and detected at the days p.i. indicated, with lane A 
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containing proteins extracted from 23 l-lO-infccted cells at 1 day p.i. following treatment with 
l-p-D-arabinofuransylcytosine (araC) at 2 h p.i.; and 

Figure 30 shows a photograph of A549 cells infected with the 231-10 vector and 
gpl9K, RIDP, and 14.7K proteins detected by indirect immunoflourescence. 

5 

Detailed Description of the Invention 

The present invention is based on the discovery that the Ad RID complex inhibits 
apoptosis mediated by death receptors, and in particular by Fas and TNFRl. Some of the 
molecular events involved in apoptosis induced through death receptors of the TNFR family 
10 are illustrated in Fig. 2. Fas (bar on the extreme right) is localized on the cell surface. When 
FasL engages Fas on the outside of the cell (top of Fig. 2), Fas associates with proteins within 
the cell (bottom of Fig. 2). First, Fas binds a protein named FADD through their 
corresponding death domains and then the Fas/FADD complex binds the protein named 
Caspase 8 through another region in FADD and Caspase 8 named the "death effector" 
IS domain. This binding activates the enzymatic activity of Caspase 8, an "initiator" caspase. 
Activated Caspase 8 cleaves other caspases (effector caspases), which then cleave other 
proteins, and apoptosis ensues. Apoptosis induced through TNFR is very similar, except that 
an additional protein, named TRADD, is involved. TNF engages TNFRl, causing it to bind 
TRADD through death domains in TNFRl and TRADD (left part of Fig. 2). The 
20 TNFRl/TRADD complex then binds FADD through their death domains and this is followed 
by binding to Caspase 8, etc. TRAIL-Rl, TRAIL-R2, and DR3 are believed to undergo a 
similar binding cascade to activate caspases, although the ligand that triggers apoptosis 
through DR3 is unknown. 

RID inhibits apoptosis by means of an internalization and degradation mechanism 
25 common to all death receptors. As illustrated in Figure 2, RID shuttles the death receptor 

from the cell surface to lysosomes where the receptors are degraded. This model is supported 
in part by the fact that the RID complex has two motifs in its intracellular portion that are 
known to play a role in the internalization of some cell surface receptors and their transport to 
lysosomes. These motifs are a dileucine motif (LL), which is present in RIDa, and a 
30 tyrosine-based motif in RIDp, which is YXX^). where Y is tyrosine, X is any amino acid, and 
<t) is an aromatic or bulky hydrophobic amino acid such as phenylalanine, tyrosine, tryptophan 
and proline. It is believed that RID acts through the LL and YXK^ motifs to cause Fas or 
TNFRl to be internalized into early/sorting endosomes. Again, acting through the LL and 
YXX<^ motifs, RID mediates transport of the early endosomes t late endosomes and then to 
35 lysosomes where the receptors are degradied. RID then recycles back to the cell surface in 
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endosomcs where ii repeats this process. Additional evidence supporting this model is as 
follows: ( 1 ) RID co-localizes with Fas on the cell surface as well as in vesicles; (2) 
degradation of Fas is inhibited by bafilomycin Al, an inhibitor of late endosome function; (3) 
the RID proteins are very stable, as indicated by pulse-chase experiments, whereas Fas is very 
unstable in the presence of RID; and (4) mutation of the LL motif severely reduces the 
function of RID, and conversion of the Y in the YXX<t» motif abolishes the function of RID. 

Because of their similar structures and common apoptotic pathway, it is believed that 
all death receptors of the TNFR family can be removed from the cell surface by RID via 
internalization into endosomes and subsequent degradation in lysozymes. Thus. RID will 
inhibit apoptosis mediated by any member of the TNFR death receptor family. As such, RID 
should be useful to promote survival of cells and tissues in the treatment of diseases such as 
degenerative diseases, immune disorders including autoimmune disorders, ischefnic injury 
such as caused by myocardial infarcdon, stroke induced neuron death and rcpcrfiision injury, 
alcohol-induced hepatitis, diseases caused by viral infection, such as AIDS and fulminant 
hepatitis, and cancer. RID is also useful in promoting survival of tissue transplants in 
transplant recipients. 

Thus, in one embodiment the invention provides a method for inhibiting apoptosis of 
a cell comprising treating the cell with an effective amount of a Receptor Internalization and 
Degradation (RID) complex. CcUs which can be treated by this method express one or more 
death receptors of the TNFR family, which includes Fas, TNFRl, DR3, TRAIL-RI, TRAIL- 
R2 and any subsequently discovered family member characterized by the presence of a death 
domain. Cells expressing a death receptor can be identified by methods known in the art, 
such as incubating the cells with one or more death receptor ligands followed by evaluating 
the cells for apoptosis, detecting death receptor molecules on the cell surface with an antibody 
against the death receptor, or detecting mRNA molecules that encode the death receptor. Cell 
death by apoptosis is readily recognizable and includes cytoplasmic and nuclear condensation, 
loss of membrane integrity and extensive fragmentation of chromosomal DNA, which forms a 
characteristic ladder when analyzed by gel electrophoresis. Vaux, D., Proc. NatLAcad. Sci 
90:7S6'7E9, 1993. Antibodies against the TNFR death receptors are either commercially 
available or can be readily prepared using standard techniques. 

The RID complex used in the method comprises at least one of each of the following 
polypeptides: a RIDa-L polypeptide, a RIDa-S polypeptide, and a RIDp polypeptide. RIDa 
and RIDp are synonymous with the 10.4K and 14.5K proteins, respectively, which are 
encoded by two genes in the Ad E3 region. The basic structures of diese polypeptides in a 
membrane are illustrated in Fig. 3. RIDa-L comprises a first transmembrane domain, which 
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IS an uncleaved signal sequence, an extracellular domain, an incemal transmembrane domain, 
and a cytoplasmic domain. RIDa-S lacks the signal sequence and thus comprises the 
extracellular domain, the internal transmembrane domain and the cytoplasmic domain. RID$3 
comprises an extracellular domain, which preferably lacks the signal sequence as shown in 
5 Fig. 4D, a transmembrane domain and a cytoplasmic domain. When the RID complex is 
localized in membrane structures and vesicles within the celU the extracellular domain is 
located in the lumen of these membranes and vesicles. 

In preferred embodiments, the RIDa-S and RIDa-L polypeptides are covalemly 
joined by a disulfide bond between cysteine residues in their extracellular domains which 
1 0 correspond by alignment with the Cysji residue of the Ad2 10.4K protein (Fig. 4A). Also, 
RIDP preferably has a mucin type O-Hnked oligosaccharide attached to one or more amino 
acids in the extracellular domain and/or is phosphorylated at one or two serines in the 
cytoplasmic domain. (See Krajcsi et al., Kiro/. 757:492-498, 1992; Krajcsi et al.. ViroL^ 
755:570-579, 1992.) The location of these residues in RDDp polypeptides encoded by E3 
1 5 genes of different Ad serotypes can be determined by alignment with the amino acid sequence 
for the 14.5K protein of Ad5, which is shown in Fig. 4C. 

A RID complex made by Ad in vivo is believed to contain RIDa-L, RID-aS and 
RIDP (lacking the signal sequence) polypeptides in about a 1:1:1 ratio. However, it is 
possible that various ratios of these polypeptides will be fiinctional or that in some cases 
20 different ratios will be required to provide a functional complex. 

The amino acid sequences of the RIDa-L, RIDa-P and RIDp polypeptides 
comprising the RID complex may be identical to those of naturally-occurring Ad RIDa 
(10.4K) and RIDP (14.5K) protems from any Ad serotype or may comprise functional 
variants of such naturally-occurring sequences. As stated above, the genes encoding the 
25 RIDa and RIDp proteins are highly conserved among Ad serotypes. These genes are also 
conserved in Ads from some non-human species. Thus, it is believed that their encoded 
products should function very similar to the RIDa and RIDP polypeptides from Ad2 and Ad5, 
which were ii'sed in the expenments described herein. In addition, the invention includes the 
use of RID complexes m which the RIDa-L, RIDa-S, and RIDp polypeptides comprise 
30 homologous amino acid sequences, i.e., encoded by the same Ad serotype, or that comprises 
heterologous sequences, i.e.. encoded by two or more Ad serotypes. Thus, for example, a 
RID complex may comprise (I) a RIDa-L polypeptide comprising the EUDa-L amino acid 
sequence from Ad2, (2) a RIDa-S polypeptide comprising the RIDa-S amino acid sequence 
from AdS, and (3) a RIDp polypeptide comprising the RIDP amino acid sequence from Ad9. 
35 Preferably, the RID complex comprises polypeptides whose amino dcid sequences correspond 
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to serotypes from the same subgroup. More preferably, the RID complex comprises EUDa-S 
and RIDo-L polypeptides encoded by the RIDa gene of Ad2 and a RIDp polypeptide 
cr.coded by the RIDp gene of Ad5. 

A functional variant of a nanirally-occurring RIDa or RIDp sequence contains one or 
more amino acid substitutions in that sequence which do not destroy the ability of the 
resulting polypeptide to function in a RID complex to inhibit apoptosis. Preferably, amino 
acid substitutions in functional variants are conservative amino acid substitutions, which refer 
to the interchangeabtlity of residues having similar side chains. Conservatively substimted 
amino acids can be grouped according to the chemical properties of their side chains. For 
example, one grouping of amino acids includes those amino acids have neutral and 
hydrophobic side chains (A, V. L. I. P, W. F. and M): another grouping is those amino acids 
having neutral and polar side chains (G. S, T. Y, C, N. and Q); another grouping is those 
amino acids having basic side chains (K, R. and H); another grouping is those amino acids 
having acidic side chains (D and E); another grouping is those amino acids having aliphatic 
1 5 side chains (G. A, V, L. and I); another grouping is those amino acids having ahphatic- 
hydroxyl side chains (S and T); another grouping is those amino acids having amine- 
containing side chains (N, Q, K, R, and H); another grouping is those amino acids having 
aromatic side chains (F, Y, and W); and another grouping is those amino acids having sulfur- 
containing side chains (C and M). Preferred conservative amino acid substitutions groups are: 
R-K; E-D, Y-F, L-M; V-I. and Q-H. In addition, conservative amino acid substimtions as 
used herein is intended to include substimtions which are present at corresponding positions 
in sequences from different Ad serotypes. 

A functional variant as used herein can also include modified sequences in which one 
or more amino acids have been inserted, deleted, or replaced with a different amino acid or a 
modified ainino acid or unusual amino acid, as well as modifications such as glycosylation or 
phosphorylation so long as the polypeptide containing the modified sequence retains the 
biological activity of a RIDa or RIDP polypeptide. By retaining the biological activity, it is 
meant that the modified polypeptide can function to form a RID complex with anti-apoptotic 
activity. 

In one embodiment, the cell is treated with the RID complex by administering to the 
cell a polynucleotide encoding the RID complex. The polynucleotide comprises a nucleotide 
sequence encoding a RIDa polypeptide and a RIDp polypeptide operably linked to a 
promoter that produces expression of the RID complex in the cell. In one variation of this 
embodiment, the polynucleotide can contoin portions of the Ad E3 region in addition to that 
portion encoding RIDa and RIDp. However, the polynucleotide predominantly expresses the 
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RIDa and RIDP proteins over any other Ad proteins. Alternatively, actions on cell apoptosis 
resulting from expression of the polynucleotide are predominantly due to the RID complex 
rather than any other protein expressed by the polynucleotide. The polynucleotide can 
comprise an expression plasmid, a retrovirus vector, an Ad vector, an adenovirus associated 
5 vector (AAV) or other vector used in the art to deliver genes into cells. Altematively, the 
polynucleotide can be administered to the cell by microinjection. 

In embodiments where the cell being treated is in a patient, such as cells comprising a 
tissue transplant or a tissue involved in an autoimmune disorder, the polynucleotide encoding 
RID is administered to the patient. Any of the vectors discussed above can be used. It is also 
1 0 contemplated that the RID complex be administered by coinfection with a replication- 
defective Ad expressing RID and another replication competent Ad that complements the 
replication defective virus to increase the expression of RID in the infected cells. 

Preferably, the polynucleotide is selectively delivered to target cells within the patient 
so as not to affect apoptosis in other tissues. Targeted delivery of the polynucleotide can be 
1 5 done for example by using delivery vehicles such as polycations« liposomes or viral vectors 
containing targeting moieties that recognizes and binds a specific marker on the target cell. 
Such methods are known in the art, see, e.g., U.S. Patent No. 5,635,383. Another targeted 
delivery approach uses viral vectors that can only replicate in specific cell types which is 
accomplished by placing the viral genes necessary for replication under the transcriptional 
20 control of a response element for a transcription factor that is only active in the target cell. 
See, e.g., U.S. Patent No. 5,698,443. 

In other embodiments of the invention, the cell is treated by administering to the cell 
a composition comprising a RID complex. The RID complex for use in such embodiments 
can be prepared by a variety of means. For example, the RID complex can be isolated from 
25 the membranes of Ad-infected cells or cells transfected with a nucleotide sequence encoding 
the RIDa and RID|3 polypeptides. Altematively, the polypeptide components of the complex 
can be expressed in separate cell cultures, extracted into an appropriate buffer and mixed in 
vitro, RIDa and RID{3 polypeptides can also be chemically synthesized and mixed to form 
the complex. The RID complex can then be tested for the ability to inhibit apoptosis of a cell 
30 expressing a death receptor as described herein for Fsis and TNFRl . 

Preferably, the RID complex is administered with a carrier that facilitates delivery of 
the RID complex into the cell, such as liposomes. Where the RID complex is being 
administered to a patient, the liposomes can have targeting moieties exposed on the surface 
such as antibodies, ligands or receptors to specific cell surface molecules to limit delivery of 
35 RID to targeted cells. Liposome dnig delivery is known in the art (see, e.g., Amsclem et al.. 
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Chem. Phys. Lipid 64:219-237, 1993). Altemaiively. one or more of the polypeptides of the 
complex can be modified to include a specific transit peptide that is capable of delivering the 
peptide into the cytoplasm of a cell or the complex can be delivered directly into a cell by 
microinjection. 

Compositions comprising a RID complex can be administered by any suitable route 
known in the art including, for example, intravenous, subcutaneous, intramuscular, 
transdermal, intrathecal or intracerebral or administration to cells in ex vivo treatment 
protocols. Administration can be cither rapid as by injection or over a period of time as by 
slow infusion or administration of slow release formulation. For treating tissues in the central 
nervous system, administration can be by injection or infusion into the cerebrospinal fluid 
(CSF). When it is intended that the RID complex be administered to ceils in the central 
nervous system, administration can be with one or more agents capable of promoting 
penetration of the protein complex across the blood-brain barrier. 

The RID complex can also be linked or conjugated with agents that provide desirable 
1 5 pharmaceutical or pharmacodynamic properties, including for example, substances known in 
the art to promote penetration or transport across the blood-brain barrier such as an antibody 
to the transferrin receptor (Friden et al.. Science 259:373-377, 1993), a polymer such as 
polyethylene glycol to obtain desirable properties of solubility, stability, half-life and other 
pharmaceutically advantageous properties Davis et al. Enzyme Eng 4:169-73, 1978; Bumham, 
20 Am J Hosp Pharm 51 a\Q-2\Z, 1994). 

For nonparental administration, the compositions can also include absorption 
enhancers which increase the pore size of the mucosal membrane. Such absorption enhancers 
include sodium deoxycholate, sodium glycocholate, dimethyl-JJ-cyclodextrin, lauroyl-l- 
lysophosphatidylcholine and other substances having structural similarities to the 
25 phospholipid domains of the mucosal membrane. 

The compositions are usually employed in the form of pharmaceutical preparations. 
Such preparations are made in a manner well known in the pharmaceutical art. One preferred 
preparation utilizes a vehicle of physiological saline solution, but it is contemplated that other 
pharmaceutically acceptable carriers such as physiological concentrations of other non-toxic 
30 salts, five percent aqueous glucose solution, sterile water or the like may also be used. It may 
also be desirable that a suitable buffer be present in the composition. Such solutions can, if 
desired, be lyophilized and stored in a sterile ampoule ready for reconstitution by the addition 
of sterile water for ready injection. The primary solvent can be aqueous or alternatively non- 
aqueous. 

carrier can also contain other pharmaceutical ly-acceptable excipients for 
modifying or maintaining the pH, osmolarity, viscosity, clarity, color, sterility, stability, rate 
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of dissolution, or odor of the formulation. Similarly, the carrier may contain still other 
pharmaceutically-acceptablc excipients for modifying or maintaining release or absorption or 
penetration across the blood-brain barrier. Such excipients are those substances usually and 
customarily employed to formulate dosages for parenteral administration in either unit dosage 
5 or multi-dose form or for direct infusion by continuous or periodic infusion. 

It is also comemplaied that certain formulations comprismg the RID complex are to 
be administered orally. Such formulations are preferably encapsulated and formulated with 
suitable carriers in solid dosage forms. Some examples of suitable carriers, excipients, and 
diluents include lactose, dextrose, sucrose, sorbitol, mannitol, starches, gum acacia, calcium 
10 phosphate, alginates, calcium silicate, microcrystallme cellulose, polyvinylpyrroUdone, 
cellulose, gelatin, syrup, methyl cellulose, methyl- and propylhydroxybenzoates, talc, 
magnesium, stearate, water, mineral oil, and the like. The formulations can additionally 
include lubricating agents, wetting agents* emulsifying and suspending agents, preserving 
agents, sweetening agents or flavoring agents. The compositions may be formulated so as to 
1 5 provide rapid, sustained, or delayed release of the active ingredients after administration to 

the patient by employing procedures well known in the art. The formulations can also contain 
substances that diminish proteolytic degradation and/or substances which promote absorption 
such as, for example, surface active agents. 

The RID complex is administered to patients in an amount effective to inhibit 
20 apoptosis of target cells within the patient. The specific dose is calculated according to the 
approximate body weight or body surface area of the patient or the volume of body space to 
be occupied. The dose will also be calculated dependent upon the particular route of 
administration selected. Further refmement of the calculations necessary to determine the 
appropriate dosage for treatment is routinely made by those of ordinary skill in the art. Such 
25 calculations can be made without undue experimentation by one skilled in the an in light of 
the activity'disclosed herein in cell death assays. Exact dosages are determined in 
conjunction with standard dose-response studies. It will be understood that the amount of the 
composition actually administered will be detemiined by a practitioner, in the light of the 
relevant circumstances including the condition or conditions to be treated, the choice of 
30 composition to be administered, the age, weight, and response of the individual patient, the 
severity of the patient's symptoms, and the chosen route of administration. Dose 
administration can be repeated depending upon the pharmacokinetic parameters of the dosage 
formulation and the route of administration used. 

The compositions and methods of the invention are contemplated for use in 
35 promoting survival of tissue transplants. For example, the tissue can be treated in vitro with 
the RID complex and the treated tissue then introduced into the transplant. In addition. 
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previously transplanted tissues can be treated with RID by administering the RID complex to 
the transplant recipient. In either scenario, it is contemplated that the RID complex can be 
administered as a protein formulation or as a polynucleotide expressing the complex. 
In another embodiment, the RID complex is used to promote the survival of 

5 leukocytes in cancer patients. The leukocytes can be treated in vivo by administering to the 
patient a polynucleotide expressing RID or a composition containing the RID complex. 
Preferably, the polynucleotide or RID complex is targeted to the leukocytes by one of the 
targeting methods discussed above. For example, cytotoxic T cells could be targeted by using 
an antibody against the CDS marker and natural killer cells targeted by use of an antibody 

10 agamst the CD 1 6 marker. Alternatively, the leukocytes can be removed from the patient, 

treated with the RID complex ex vivo, and the treated leukocytes then returned to the patient. 

Preferred embodiments of the invention are described in the following examples. 
Other embodiments within the scope of the claims herein will be apparent to one skilled in the 
art from consideration of the specification or practice of the invention as disclosed herein. It 

15 is intended that the specification, together with the examples, be considered exemplary only, 
with the scope and spirit of the invention being indicated by the claims which follow the 
examples. 

Example I 

This example illustrates inhibition of Fas-mediated apoptosis by adenovirus EIB and 
E3 proteins. 

Human breast adenocarcinoma cells expressing Fas (MCF7-Fas) (Jaattela et al.. 
Oncogene 70:2297-2305, 1995) were infected with rcc700 or with an adenovirus mutant 
lacking expression of one or more of the RIDa, RIDp, E3-14,7K and EIB-19K proteins. 
rec700 is an Ad5-Ad2-Ad5 "wild-type" recombinant whose genome consists of the Ad5 
£coRl A (map positions 0 to 76), Ad2 £coRI D (map positions 76 to 83), and Ad5 EcoRI B 
(map positions 83 to 100) fragments (Wold et al., ViroL /^S: 168-188, 1986). rec700 is the 
parental virus of E3 mutants with 700 or 7000 numbers. The infected cells were treated with 
a monoclonal antibody to Fas, CH-1 U' which acts as an agonist of Fas and induces apoptosis. 
The cells were then fixed and stained for DN A and for the adenovirus DNA binding protein 
(DBF). Experimental details are provided in the foomote to Table I. 

Examples of apoptoiic and non-apoptotic nuclei in rcc700-infected cells arc shown in 
Figs. 6A and 6B. Most cells were infected as indicated by the speckled staining of DBF in 
the nucleus (Fig. 6A), and these nuclei were non-apoptotic (Fig. 6B). Two iminfected cells 
were apoptotic (arrows in Figs. 6 A and 6B) as evidenced by the presence of shrunken and 
irregular nuclei with condensed DNA that often fluoresced very brightly above the plane of 
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focus for non-apoptoitc nuclei. The percentage of apoptotic and non-apoptotic nuclei was 
scored in rec700- or mutant-infected cells staining for DBP and the quantitative results are 
shown in Table 1 below. 

5 Table I. Fas Agonist-induced Apoptosis in MCF7-Fas Cells Infected with Ad 

Mutants' 





1 

Ad DNA Binding Protein-Positive Cells* j 


Virus Mutant 


Apoptotic 


Non-apoptotic ; 


rec700 (wild type) 


0.1' 


99.9* ! 


pm760(ElB-19K'.RID') 


0.7 


99.3 i 


d/309 (EIB-19K*. RID-) 


0.1 


99.9 


rfn48 (E1B-19IC-.RID ") 


0.6 


99.4 


<f/764(ElB-19K*.RID ') 


0.2 


99.8 1 


lp5 (E1B-19K-. RID*) 


9.9 


90.2 i 


(tnSO (E1B-19K-. RID*) 


10.4 


S9.6 : 


1 dllW (E1B-19K-.RID-) 


S7.2 


12.8 1 


1 rf/118(ElB.19K*,RID-) 


1 94.1 


5.9 1 



*MCF7-Fas cells were infected with 250 PFU per cell of virus except for IpS, rf/250, dllWi 
and dl\ 18 where 10 PFU per cell was used. At 21 h post-infection (p.i.), cells were treated for 

10 22 h with the CH-ll agonist mAb to Fas (200 ng/ml) (Panvera, Madison, Wl) plus 
cycloheximide (25 ^lg/ml). Cells were fixed and stained for the Ad DNA binding protein 
(DBP) using a rabbit antiserum (obtained from Maurice Green, St. Louis University) and goat 
anti-rabbit IgG (fluorescein conjugate) and for DNA using 4, 6-diamidino-2-phenylindolc 
(DAPI). Typical apoptotic and non-apoptotic nuclei arc shown in Fig. 6B. which is from the 

15 same experiment. Nuclei of £//l 1 1- or c//l IS-intected cells not treated with Fas agonist were 
not apoptotic (not shown), indicating that the apoptosis observed was not due to the cyt deg 
phenoiype of ElB-19K-negativc mutants (Subramanian et al., ./. P'iroL 52:336-343, 
1984). 

20 "At least 1000 DBP-positive cells were counted per sample. 

^Percent of apoptotic and non-apoptotic nuclei in cells staining for DBP. 
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In cells infected with reclQO or mutant pml6Q, which expresses both EIB-19K and 
RID. very few nuclei were apoptotic. Cells infected with mutants expressing EIB-19K but 
lacking RIDa and E3-14.7K (<ff748), or lacking RIDp (J/764), or lacking each of RIDa, 
RID(3, and E3-14.7K (<f/309) also had very few apoptotic nuclei. However, only about 10% 

5 of cells infected with IpS and rf/250, which lack EIB-19K but express RID, had apoptotic 
nuclei, while about 90% of the nuclei were apoptotic in cells infected with dl\ 1 1 and dlWZ, 
which lack expression of RIDa, RIDp, E3 14,7 K and E1B.19K. These results indicate that • 
adenovirus has two proteins that independently inhibit Fas-induced apoptosis, RID and/or E3- 
14.7K in the E3 transcription unit and E1B-19K in the ElB transcription unit. This result 

1 0 observed with E 1 B- 1 9K is consistent with an earlier report (Hashimoto, S et al.. Int. 
ImmunoL i:343-35l, 1991. Data below show that RID inhibits Fas-induced apoptosis. 



Example 2 

This example illustrates that the RID complex is sufficient to inhibit apoptosis. 
15 To address whether RID is sufficient to inhibit Fas-induced apoptosis, plasmids 

expressing RIDa or RIDP from the Ad major late promoter plus SV40 enhancer were 
prepared by cloning the gene for RIDa or RIDp into the pMT2 vector (Mazzarella, R. A. & 
Green, M. 7. Biol. Chem. 262: 8875-8883, 1987) to generate pMT2-RIDa and pMT2-RIDp. 
MCF7.Fas ceils were transiently transfectcd with pMT2-RIDa plus pMT2-RIDp, pMTn- 
20 RIDp alone, or pMT2 alone (2.5 ^ig for each plasmid). After 38 h, cells were treated for 9 h 
with the CH-1 1 agonist mAb to Fas (500 ng/ml) plus cycloheximide (25 ng/ml), fixed in. 
methanol with DAPI, and stained for RIDp using the rabbit PI 18-132 antipeptide antiserum 
(Tollefson et al, Virology 1 75: 1 9-29, 1 990). 

Examples of apoptotic and non-apoptotic nuclei in the cells co-transfected with 
25 pMT2-RIDcLand pMT2-RIDp are shown in Figs. 6C and 6D. The cell transfectcd with RIDa 
plus RIDP (arrow in Fig. 6C) was non apoptotic (arrow in Fig. 6D). lUDP-negative cells 
usually had apoptotic nuclei (most cells in Fig. 6D). Of 2000 cells counted in random fields. 
173 RIDP^positive cells were seen, and only 26% of these had apoptotic nuclei. In the 
transfection with RIDP alone, and with 2000 cells counted, lOl RIDP-positive cells were 
30 seen, 80% of which had apoptotic nuclei. With pMT2 alone, 62% of the total nuclei were 
apoptotic. These results indicate that RID (i.e. RIDa plus RIDp), but not RIDp alone, is 
sufficient to inhibit Fas-induced apoptosis. 
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Example 3 

This example illustrates that RID down-regulates Fas from the cell-surface of 
adenovirus-infected human breast carcinoma cells. 

To investigate how RID inhibits apoptosis, MCF7-Fas cells were infected with 
5 adenovirus serotype 5 (Ad5), reclOO, or an Ad mutant lacking expression of one or more of 
RIDa, RIDp, and E3-14.7K proteins. At 28 h p.i., cells were detached using 0.025% EDTA. 
then resuspended in FACS buffer (IX PBS, 2% FBS). Approximately I X 10* cells were 
pelleted and resuspended in 50 \l\ FACS buffer containing antibodies against human Fas 
(UB2 IgG mAb) (Panvera) (10 \ig/m\), the human transferrin receptor 
i 0 (Boehnnger/Mannheim, Indianapolis, IN) (2.5 jig/ml) and purified mouse IgGy (PharMingen, 
San Diego, CA) (5 jig/ml) as an iso-type control. In common with Fas, the transferrin 
receptor is a cell surface receptor. Cells were incubated with the primary antibodies, washed 
with cold FACS buffer, incubated with 20 M-g/ml of goat anti-mouse FITC -conjugated 
antibody (ICN), washed, then analyzed on a FACScaliber flow cytomcter (Becton Dickinson, 
1 5 Mountain View, CA). The data were analyzed with Cell Quest software (Becton Dickinson) 
and are shown in Figure 7. 

Nearly all Fas (bold trace in Fig. 7) was cleared from cells infected with Ad5 or 
rcc700 (Figs. 7B, 7C). Transferrin receptor (dashed trace) was not affected. Fas was not 
cleared from cells infected with mutants lacking RIDa and/or RIDp« namely <f/309 (lacks 
20 RIDa, RIDP, E3-14.7K) (Fig. 7D), rf/748 (lacks RIDa) (Fig. 7E), and dPM (lacks RIDP) 
(Fig. 7F). Fas was down-regulated by rf/758 (RID-positive, lacks E3-14.7K) (Fig. 7G) and 
pmieo (overexpresses RIDa and RIDp) (Fig.7H). These results indicate that RID (i.e. RIDa 
and RIDP) is necessary to clear Fas from the surface of Ad-infected MCF7-Fas cells. Other 
Ad proteins, including E3-14.7K and EIB-19K, are not required. 

25 

Example 4 

This example illustrates that RED do^n-regulates Fas from the cell-surface of 
adenovirus-infected human lung adenocarcinoma cells. 

To determine if RID can remove Fas from the surface of other cell types, the human 
30 A549 cell line was examined. AS49 cells are derived from a human lung adenocarcinoma. 

A549 cells were mock-infected or infected with rec700. At 26 h p.i., cells were suspended in 
FACS buffer containing mouse IgGy, anti-human-Fas UB2 IgG monoclonal antibody 
(Panvera), or antibody against the human transferrin receptor (Boehnnger/Mannheim), 
incubated with goat anti-mouse fluorescein isothiocyanate (FrTC)-conjugated antibody, and 
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anaiyzed on a FACScaliber flow cytomcter using Cell Quest software (Becton Dickinson). 
The results are shown in Fig. 8. 

With mock-infected cells (Fig. 8B), there was strong staining for both Fas (the red 
trace in Fig. 8) and transferrin receptor (the blue trace in Fig. 8). With reclQO or pm760, a 

5 virus mutant that overexpresses RID (i.e., RIDa plus RIDP) and underexpresses other Ad E3 
protems. Fas was completely cleared from the cell surface whereas the transferrin recepior 
was not affected (Figs. 8C, 8H). With three virus mutants that lack both RIDa and RID6 
idl309), RIDP only (^f/764), or RIDa (if/748). Fas was not cleared from the cell surface (Fig. 
8, Panels E, F, and G). With cf/758, a mutant that lacks only E3-14.7K and that expresses 

10 RIDa and RIDP, Fas was down-regulated to the same extent as with reclOQ and pml60. 

Therefore, the E3-14.7K protein is not required to down-regulate cell surface Fas. Recently, 
RID was reported to clear Fas from the cell surface in two. other human ceil lines, HT-29. 14S 
andME-l80(ShisleretaUy. Virol. 77:8299-8306, 1997). These results have been comlrmed 
with HT-29. 14S and ME- 1 80 cells (data not shown). Thus, RID stimulates the removal of 

1 5 Fas from die cell surface of at least four different cell types, MCF7-Fas, A549, HT-29. US, 
and ME- 1 80 cells. 

Example S 

This example illustrates that Fas molecules removed from the cell surface by RID are 
20 internalized into vesicles and then degraded in iysosomes. 

Many receptors are internalized into endosomes. Accordingly, MCF7-Fas cells were 
mock-infected or infected with rec700 or with an E3 Ad mutant. MCF7 cells were mock- 
infected as a control. At 19 h p.i., cells were fixed in methanol and stained for Fas using the 
ZB4 mAb (Panvera) and goat anti-mouse IgG (Texas red conjugate). The results are shown 
25 in Figure 9. — • 

Fas was not detecfed in mock-infected parental MCF7 cells (Fig. 9A), but was readily 
apparent on the surface of MCF7-Fas cells (Fig. 9B). In cells infected with r€c700. Fas was 
in numerous vesicles and there was no cell surface staining (Fig. 9C). These vesicles are 
likely to be endosomes and Iysosomes containing Fas. These vesicles were not observed with 
30 <i/309, if/748, or dUM (lack RIDa and/or RIDp), whereas in each case, strong Fas staining 

was apparent at the plasma membrane (Figs 9D-9F). Vesicles staining for Fas were seen with 
dllSZ andpm760, both of which express RID (Figs. 9G, 9H). 

Some receptor types internalized into endosomes are targeted to Iysosomes where 
they are degraded. To determine whether Fas was degraded in Ad-infected cells expressing 
35 RID, MCFT-Fas cells were mock-infected or infected with wild-type Ad or an E3 mutant 
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lacking expression of one or more of RIDa, RIDp, and 14.7K proteins, then at 27 h p.i. 
proteins were extracted, separated by sodium dodecyl sulfate polyacrylamide gel 
electrophoresis (SDS-PAGE), and blotted onto an Immobilon-P membrane. After blocking, 
membranes were incubated with rabbit anti-Fas antiserum (Santa Cruz Biotechnology, Inc., 
5 Santa Cruz, C A), mouse anti-transferrin receptor mAb OKT9 ( ATCC), or mouse anti-E I A 
mAb M73. Membranes were incubated with the appropriate peroxidase-conjugated 
secondary antibody (ICN). Proteins were detected with ECL reagents (Amersham Life 
Sciences, Arlington Heights, IL) and the results are shown in Fig. 10. 

Fas was degraded in cells infected with virtises that express both RIDa and RIDP 
1 0 ( AdS, Ad2, rec700, <f/758, pm760) (Fig. lOA). Transferrin receptor was not degraded in these 
same extracts (Fig. lOB). Fas expression was actually stimulated in cells infected with 
mutants that lack RIDa and/or RID(J (Fig. 10 A, compare Mock with rf/309, (fr748, and 
rf/764). The Ad-coded El A proteins were expressed at similar levels (Fig. IOC), indicating 
that all infections were equivalent These and the above results establish that RID (i.e. RIDa 
1 5 and RIDp) functions in the internalization of Fas into putative endosomes, the degradation of 
Fas, and the inhibition of Fas-induced apoptosis. 

RID has been reported to stimulate the internalization of EGFR into vesicles and its 
degradation in lysosomes (Carlin et al.. Cell 57:135-144, 1989; ToUefson et al., J. Virol. 
65:3095-3105. 1991). When the epidermal growth factor receptor (EGFR) interacts with its 
20 ligand, EGF, EGFR is mtemaiized into early endosomes which are transported to late 

endosomes which fuse with lysosomes, where EGFR is degraded. This process results in 
attenuation of signal transduction through EGFR. Many receptors are degraded by the 
endosome-lysosome pathway in response to ligand. To determine if RID-induced degradation 
of Fas is occurring through this pathway, the following experiments were performed. 
25 The first experiment, which was described in the copending provisional application, 

examined Fas localization in COS cells transiently co-transfected with combinations of 
expression plasmids for Fas, RID a and RIDp. The following plasmids were used, the 
pMT2-RIDa and pMT2-RIDP plasmid vectors described in Example 2, and pcDNA3-Fas, 
which expresses Fas from the human cytomegalovirus promoter (CMV). C0S7 cells were 
30 transfected (Mazzarella, R. A. & Green. M. J. Biol. Chem, 262:8875-8883, 1987) with I ^ig 
each of pMT2-RIDa plus pcDNA3-Fas, pMT2-RIDP plus pcDNA3-Fas, or pMT2-RIDa, 
pMT2-RIDP, and pcDNA3-Fas. After 30 h, cells were fixed in methanol with DAPI and 
stained for Fas using the ZB4 mAb, for RIDa using the rabbit P77-91 antipeptide antiserum, 
or for RIDP using the rabbit PI 18-132 antipeptide antiserum (Tollefson et al., J. Virol 
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6-/:794.80l, 1990; Tollefson et al.. Virology 775:19-29, 1990). The results are shown in 
Figure II. 

With cells co-transfected with expression plasmids for RIDa plus Fas, or REDP plus 
Fas, Fas was localized on the cell surface (Fig. 1 IB, 1 ID). In contrast, with cells triple- 

5 transfected with expression plasmids for RIDa. RID(J, and Fas, Fas was in vesicles rather 

than the ceil surface (Fig, 1 IF. UH). RIDp staining was typical of the endoplasmic reticulum 
(ER) and plasma membrane, a probable site of RID action (Stewart et al., J, Virol. 69:172- 
l S 1 . 1 995): many vesicles containing RID(3 appeared to co-localize with vesicles containing 
Fas (arrows in Fig. 1 IG and 1 IH). Distribution to the ER was also characteristic of RIDa 

0 (Fig. I IE), and in some cells the plasma membrane was stained (not shown). RIDa did not 
co-localize with Fas-containing vesicles. Thus, RID (i.e. RIDa plus RIDP) is sufficient to 
internalize Fas into vesicles. 

In a second experiment. Fas localization was examined in Ad-infected cells. Human 
A549 cells were infected with recTOO fixed using 3.7% paraformaldehyde followed by 

15 methanol/DAPI (4,6-dianiidino-2-phenylindole). Cells were double-stained for Fas and 
LAMPl, which is a lysosomal protein (Carlsson et al.. J, BioL Chem. 15:18911-18919, 
1988), using a rabbit anti-Fas antibody (Santa Cruz Biotechnology) and the BB6 mouse anti- 
human-LAMP-1 monoclonal antibody (Carlsson et ah, supra), followed by goal anti-rabbit 
IgG-FITC and goat anti-mouse IgG-RITC (rhodamine isothiocyanatc) (Cappel ICN). Cells 

20 were examined using a Zeiss LSM 4 1 0 scanning laser confopal microscope with LSM 410 
software. The results are shown in Figure 12. 

Green, red, and yellow vesicles contain Fas (Fig. 12A), LAMPl (Fig. 123). or both 
Fas and LAMPl (Fig. 12C, 12D), respectively. The many yellow vesicles establish that Fas 
co-localizes with LAMPl in lysosomes. The Fas-containing green vesicles may be 

25 endosomes. Similar results were obtained with another lysosomal protein, CD63 (data not 
shown). 

To obtain additional evidence supportmg the involvement of the endosome-lysosome 
pathway in RID-induced Fas degradation in Ad-infected cells, the effect of Bafilomycin A I 
(Baf) treatment was investigated. Baf specifically inhibits the vacuolar-type iT-ATPase, 
30 preventing vesicle acidification and trafficking of receptors from late endosomes to lysosomes 
(Yoshimori etal.,y. BioL Chem. 266:17707-17712, 1991; van Weert etal, J. Cell. BioL 
130:821-834, 1995). A549 cells were mock-infected or infected with rec700 or rf/309 (lacks 
RID). At 13 h after infection, cells were treated with Baf (0.1 |iM) for 12 h and then 
immunostained for Fas. In a separate experiment, cells were treated with Baf at 6 h after 
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infection and processed for immunoblot analysis 18 h later. The results are shown in Figure 
13. 

When wild-type Ad-infected cells were treated with Baf, Fas was cleared from the 
cell surface but it accumulated in vesicles (Fig. 1 3 A) rather than being degraded as in 

5 untreated cells (Fig. 13B). Baf did not affect cell surface Fas in cells infected with a mutant 
lacking RID {^f/309) (Fig. 13C), Immunoblot analysis of proteins extracted from these cells 
indicated that Baf blocked the degradation of Fas m wild-type Ad-infected cells (Fig. 13D). 
Baf did not affect the abundance of Fas in mock-infected cells or m cells infected wi:h the 
EUD-minus mutant. Neither virus infection nor Baf affected the abundance of Erp72 (Fig. 

10 1 3D), a cellular protein localized in the endoplasmic reticulum (Mazzarella et al., 1990). 
Also, neither virus infection nor Baf significantly affected the level of another cellular 
protem, the transferrin receptor (Fig. 13E). The infections were equivalent as indicated by the 
E1B-19K levels of the Ad-encoded protein (Fig. 13D), These confocal microscopy and Baf 
data provide strong evidence that RID causes Fas to be degraded in lysosomes in Ad-infected 

15 cells. 

Example 6 

. This example illustrates that the RID proteins are sufficient to promote the 
degradation of Fas. 

20 COS cells were transiently transfected with different combinations of pMT2-RIDa, 

MT2-RIDP, pcDNA3-Fas, and pBUC-Shp-l, which expresses a mammalian cell protein 
named Shp-1 . At 36 h post-transfection, cells were treated with cycloheximide (25 p-g/ml) for 
1 2 h and at 48 h post-transfection, proteins were extracted and analyzed for Fas, Shp- 1 , or 
ERp72 by immunoblot using rabbit antisera to Fas (Santa Cruz), Erp72 (Mazzarella et al., 

25 1990), or Shprl (Plas et al., 1996) (Tollefson et al.. Nature 392:726-730 (1998)). Tne results 
are shown in Fig. 14. 

In cells transfected with pcDNA3-Fas and/or pBUC-Shp-1, expression of Fas and/or 
Shp-1 proteins was readily detected by inununoblot (Fig. 14, lanes b-d). For Fas, two 
groupings of bands were detected (indicated by the arrows), which represent differentially 

30 glycosylated species of Fas. The anti-Fas antibody also reacted with an unknown cellular 
protein that migrated between the two sets of Fas protein bands. When pMT2-RIDa or 
pMT2-RIDP were co-transfected with pcDNA3-Fas and pBUC-Shp-l, there was a marginal 
decrease in Fas and Shp- 1 (Fig. 14, lanes e and f). However, when both pMT2-RIDa and 
pMT2-RID() were co-transfected with pcDNA3-Fas and pBUC-Shp-1, the Fas bands were 

35 reduced to nearly undetectable levels, whereas the Shp- 1 band was only marginally decreased 
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(Fig. 14, lane g). The levels of the endogenous cellular protein, Erp72, were equivalent in all 
of the transfected cells. These results indicate that the RID complex (i.e. RIDa plus RDDP), 
but not RIDa or RIDp alone, is sufficient to induce degradation of Fas. 

A similar experiment was conducted except that cells were transfected with the 

5 pcDNA3 . 1 -CAT (In Vitrogen, Carlsbad, C A) plasmid expressing chloramphenicol acetyl 

n-ansferase (CAT) instead of pBUC-Shp-1 . Since CAT is a bacterial protein, it is not possible 
for RID to have evolved in Ad to exert a specific biological effect on CAT. Expression of 
this protein was detected by immunoblot using anti-CAT antisemm obtained from 5 prime-3 
prime. The results of the experiment were similar to those with Shp-l, i.e. Fas was greatly 

10 reduced in the presence of RID, whereas CAT was only marginally affected (Fig. 15). 

These experiments demonstrate that the RID complex is sufficient to induce the 
internalization of cell-surface Fas into vesicles, presumably endosomes and iysosomes, to 
induce degradation of Fas, presumably in Iysosomes. and to inhibit apoptosis triggered by an 
anti-Fas agonist monoclonal antibody. 

15 

Example 7 

This example illustrates that RID inhibits killing of Ad-infected cells by natural killer 
cells and cytotoxic lymphocytes. 

Natural killer (NK) cells and cytotoxic T-lymphocytes (CTL) play an important role 
20 in the destruction of virtis-infected cells during the early innate phase and the late immune- 
specific stages,»respectively, of the host anti-viral response. Both NK and CTL kill targets via 
two major pathways. In one major pathway, perforin generates holes in the target and 
granzymes are introduced to induce apoptosis of the target cell. In another major pathway. 
Fas ligand on the surface of the CTL engages Fas on the target cell and induces apoptosis 
25 through activation of the pro-apoptotic caspases. CTL can also kill cells through a third 

minor pathway, in which TNF expressed on the surface of CTL (or secreted by CTL) engages 
TNFRl on targets and induces apoptosis via the caspases. In cell culture, TNF-mediated 
killing by CTL is observable in long tenm (> 24 h) killing assays. To investigate whether RID 
inhibits NK- and CTL-killing through Fas, the following experiments were conducted. 
30 In the first experiment, which was described in the copending provisional application, 

the effect of Ad proteins on CTL-killing was assessed by performing a shon-term CD3- 
dependent redirected cell assay (Azuma et al., J. Exp. A/c^./ 75:353-360, 1992), using 
lymphocytes from perforin (-/-) mice (Kagi et al.. Science 265:528-530, 1994) and from wild- 
type perforin (+/+) C57BLy6J mice acutely infected with influenza virus. Influenza virus 
35 enhances the expression of Fas ligand in activated lymphocytes (Clark et al., Immunol. Rev. 
I ^(5:33-44. 1995). In brief, mice were primed by intranasal infection of 50 HAU of HkX3 1 
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influenza A virus (Topham ct al., J, Virol 7^:1288-1291, 1996; Tripp et al., J. Immunol, 
1 54:60 1 3-602 1 , 1 995). CTL were isolated from the spleens of the infected mice, irradiated, 
and effector CTL generated by secondary in vitro re-stimulation. These CTL were further 
activated by incubation with the 145-2CI 1 anti.CD3e mAb for 30 min on ice. Mouse Fas and 
5 Fc receptor-positive P815 ceils (I X 10*) were mock-infected or infected with 1000 PFU per 
cell of recTOO or <f/700l and labeled overnight with 100 ^Ci of Na2**Cr04. These ^*Cr- 
labeled P8 1 5 target cells were washed, resuspended in DME, and then incubated with the 
activated anti-CD3e-n-eated CTL using effector lymphocytertarget ratios of 60:1, 20:1 or 6:1. 
Cell lysis was deicrmmcd 6 h later from a standard ^*Cr release assay and the results are 
10 shown in Figs. 16A and 16B. The presence of Fas on the surface of P815 cells infected with 
reel 00 or dllOOO was also examined by flow cytometry and the results are shown in Fig. 16C. 

The perfonn (-/•) CTL lysed mock-infected P815 cells efficicndy (Fig. 16 A). Lysis 
was inhibited by recTOO but not by dllOOl (lacks all E3 genes). Since the mice lack perforin. 
It follows that the CTL were killing the mock- and mutant-infected cells through the Fas 
1 5 pathway and that the E3 region is required to inhibit killing through this pathway. The CTL 
from perfonn (+/-»-) mice killed mock-, rec700-. or ^nOOl-infected P815 cells with similar 
high efficiency (Fig. 16B). Cell surface Fas was diminished on P81S ceils infected with 
reclOO but not with rf/7000 (lacks all E3 genes excqst for E3-14.7K) (Fig. 16Q. These 
results indicate that E3 proteins expressed by recTOO but not ^000, presumably. RID, inhibit 
20 CTL killing through the Fas pathway by down-regulating Fas from the cell surface. 

A seco.nd experiment was conducted to investigate the role of RID in inhibiting 
killing of Ad-infected cells by NK cells. Human AS49 cells were mock-infected or infected 
with recTOO (wild-type Ad) or rf/764. a virus mutant that lacks only RID3 and then labeled 
with 100 \xCi of Naj^'CrO^. These ^*Cr-labeled A549 target cells were washed, resuspended 
25 in DME, an.dJthen incubated with a semi-permanent line of human NK cells. After 24 h. cell 
lysis was measured based on release of ^'Cr from the cells as described elsewhere (Tollefson 
et al.. Nature 392:726-730 (1998)) and the results are shown in Figure 17. 

Mock-infected cells were lysed efficiently at NIC:AS49 cell ratios of 10:1 and 5:1 
(Fig. 17). This lysis was dramatically inhibited by infection with rec700, but it was only 
30 marginally reduced by infection with dllM (Fig. 17). Since the only protein not expressed by 
^/764 is RID(3, it is believed that RID is required to inhibit killing of Ad-infected cells by NK 
cells. Most likely RID inhibits killing by NK cells by blocking the Fas pathway. However, a 
RID effect on the perforin-granzyme pathway cannot be excluded. 

In summary. RID inhibits killing of Ad-infected cells by NK cells and by CTL. Thus, 
35 RID should protect infected cells from attack by killer cells that are active in both the early 
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innate phase and the late immune-specific phase of the anti-viral immune response. 
Similarly, transplanted cells and tissues are destroyed by NK cells and CTL. Therefore, RID 
should be useful to inhibit killing of transplanted cells or tissues by NK cells and CTL. 

5 Example 8 

This example illustrates that RID is required and probably sufficient to remove the 
TNFRI from the cell surface. 

Human HeLa cells were mock-infected or infected with 50 PFU/cell of rec700 (wild- 
type) or dim, which is a rec700-derived mutant with a deletion m the adp gene in the E3 
10 region that results in overexpression of both RID (i.e. RIDa and RIDP) and E3-14.7K, and 
only trace amounts of other E3 proteins (Tollefson et al., J. ViroL 64,794-801, 1990; 
ToUefson et al., ViroL 175:19-29. 190; Gooding ct aL, Cc// 53 :341o46, 1988). At26hp.i., 
cells were analyzed by flow cytometry (Tollefson et al.. Nature 392:726-730 (1998)) using 
the B/0:2/18/91 rabbit antiserum against TNFRI (obtained from Immunex Corp.) and PE- 
1 5 conjugated goat anti-rabbit IgG (Caltag). Fas was detected in the same experiment using 
supematants from the M38 anti-Fas hybridoma cell line (obtained from the American Type 
Culture Collection) and Fi rC-conjugated goat anti-mouse IgG. The results are shown in 
Figure 18. 

As shown in Fig. 1 8 A, TNFRI was removed from the surface of most cells infected 
20 with reclOO (red trace) or rf/712 (blue trace). The percentage of mock-infected cells th^t were 
stained for TNFRI was 93%, as compared to 16% and 18%, respectively, forrec7G0 and 
^/712. In this same experiment, cell surface Fas was also internalized by rec700 and J/712 
(Fig. 18B). Thus, Ad infection removes .TNFRI from the cell surface, as is the case with Fas. 
The mutant used in the above expenment/dr712, overexpresses RID and E3-14.7K, 
2S and expresses very little of the <^ec E3 proteins. To determine whether RID and/or E3 14.K 
is involved in internalization ofSNFRl in Ad-infected HeLa cells, the same experiment was 
perfonned using rf/712 and additional E3 mutants: rf/309. which lacks RIDa, RIDp, and E3- 
14.7K; rf/753, which lacks RIDa but expresses RIDP and E3-14.7K; and d/764, which lacks 
RIDP but expresses RIDa and E3-14.7K. The deletions in these mutants do not affect 
30 expression of any other Ad proteins. The results are shown in Figure 19. 

With rec700 and rfr712, TNFRI was removed from the cell surface such that only 
29% and 24%, respectively, of cells were stained for TNFRI as compared to 92% with mock- 
infected cells (Fig. 19A). With rf/309, rff753, and dHM infected cells, 84%, 85%, and 84%, 
respectively,«were stained for TNFRI, indicating that these mutants did not induce removal of 
35 TNFRI from the cell surface. Cell surface Fas was also examined in this same experiment. 
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reclOO and dU 12 cleared Fas whereas rf/753, and dllM did not (Fig. 19B). Tnus, RID 

is required to remove TNFRl from the surface of Ad-infected cells, as is the case with Fas. 

As a means to determine whether RID is sufficient to remove TNFRl from the cell 
surface, HcLa cells were infected with the Ad vector named 23 1-IO. This vector will be 
5 described in detail in Example 10 below. In brief. 23 1-10 lacks the El A. ElB, and E3 
transcription units. The deleted El A plus ElB regions are replaced with an expression 
cassette wherein all the E3 proteins are expressed from the human cytomegalovirus (CMV) 
promoter. Because 23 1-10 lacks El A, viral genes in the vector backbone are not expressed; 
only the E3 proteins are expressed from the CMV promoter. Thus, the vector serves as an 
10 essentially inert vehicle by which E3 genes can be delivered into cells and the properties of 
their proteins studied. 

HeLa cells were mock-infected or infected with the 23 1- 10 vector, and cell surface 
TNFRl was examined by tlow cytometry at 24 h and 48 h p.i. as described above. .At 24 h 
p.i., the percentage of cells bearing TNFRl was reduced from 93% to 35%, and by 48 h the 
1 5 percentage was reduced to 1 1 % (Fig. 20). This time course of TNFR I down-regulation i 
correlates with expression of the E3 proteins. In a parallel experiment. Fas was nearly 
completely cleared by 24, 36, and 48 h p.i. (data not shown). Thiis, TNFRl and Fas are 
removed from the cell surface by the E3 proteins expressed by 231-10. RID is undoubtedly 
the E3 protein responsible for the removal of these death receptors. 
20 The ability of Ad and the RID protein to remove TNFRl from the cell surface was 

examined using the biotin-streptavidin system (Stewart et aL. 1995) to detect TNFRl. 
Multiple dishes of A549 cells were mock-infected or infected with 50 PFU/cell of rec700 
(wild-type).. At 16 h p.i., cell surface proteins in mock- and Ad-infected cells were labeled 
using biotin. Ad-infected cells in other dishes were also labeled with bioiin at 18, 20, 22, 24, 
25 and 30 h p.i. Proteins were extracted using buffer containing 0.5% NP-40, and were 

incubated with protein A-Sepharose CL-4B attached to the B/0:2/l8/9l rabbit antiserum 
against TNFRl . After washing, proteins were solubtlized, subjected to SDS-PAGE, and 
transferred to membranes. Membranes were incubated with peroxidase-conjugated 
streptavidin (Sigma), and proteins were visualized using ECL (Amersham). 
30 In this assay, if Ad infection has resulted m the removal of TNFRl from the cell 

surface, then TNFRl will not be available for biotinylation and therefore TNFRl will not be 
detected. As shown in Fig; 21, similar amounts of TNFRl were obtained from mock- or 
recTOO-infected cells at 16 h p.i. With rec700, TNFRl declined from 18 to 30 h p.i. until only 
small amounts were detected. Thus, as was the case when TNFRl was detected by flow 
3 5 cytometry. Ad infection results in markedly decreased amounts of cell surface TNFRl . 
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The ability of the 231-10 Ad vector to down-regulate ceil surface TNFRl as 
determined with the biotin-streptavidin assay was also examined. As discussed above, 23 1- 
10 expresses only Ad E3 proteins. Cells were mock-infected, infected with 50 PFU/cell of 
reclOO (wild-type), or infected with 250 PFU/cell of 231-10. At different days p.i.. cells were 

5 biotinylated and TNFRl detected as described above. As expected, most of the TNTRl was 
cleared by reclQO at I day p.i. (Fig. 22 A, compare lanes a and b). With 23 1-IO. reduced 
amounts of TNFRl were delected by I day p.i., and by 5 days p.i. the TKFRl levels declined 
to those of rec700. Tne levels of TNFRl in mock-infected cells were similar after 5 days to 
those after I day (Fig. 22A, compare lane h with lane a). Therefore, the reduction at 5 days 

10 seen with 231-10 is not due to a non-viral event associated with maintaining the cells in 

dishes for 5 days. These results indicate that the E3 proteins expressed by the 23 1-IO vector, 
presumably RID, are sufficient to clear TNFRl from the cell surface. 

The accumulation of RIDp in these same cell extracts was also examined by standard 
immunoblot using the rabbit PI 18-132 antiserum (Stewart et al., 1995). With recTOO, RIDp 

1 5 was abundant after 1 day (Fig. 22B, lane b). The multiple bands on RIDP are species of 
RID{3 that are differentially O-glycosylated and phosphorylated. With 23 1 -10, RIDp was 
detected after 2 days, and it increased dramatically in abundance from days 3-5 (Fig. 22B, 
lanes c-g). Therefore, as expected, the accumulation of RIDP in this experiment correlated 
inversely with the decline in cell-siorface TNFRl . 

20 These results obtained using the B/0:2/18/9l antibody in the biotin-streptavidin and 

flow cytometry assays to detect TNFRl are consistent. Thus, it is believed that RID is 
« 

necessary to efficiently down-regulate cell surface TNFRl in Ad-infected cells. The results 
with 231-10 indicate that RID is sufficient to down-regulate TNFRl, with the caveat that the 
E3 14.7K and gpl9K proteins, and possibly the E3 12.5K and 6.7K proteins, arc expressed by 
25 231-10. ... 

To determine if RID is responsible for clearance of cell-stirface TNFRl, the 
following Ad E3 mutants were used: dUAS, which overexpresses RIDP but lacks RIDa; and 
dn9i, which overexpresses RIDa but lacks RIDp. A549 cells were mock-infected or 
infected with 50 PFU/cell of rec700, d/748, or J/798, or infected with 25 PFU/cell each of 
30 dn^% and rf/798. At 26 h p.i. cells were biotinylated and TNFRl examined as described 

above. As a positive control, a dish of mock-infected cells was treated with TNF. and the cell 
extract was examined for TNFRl . As expected, TNF removed most of the TNFRl from the 
cell surface (Fig. 23 A. lanes a and b). 

The results with the viruses are shown in Fig. 23 A, lanes c-f. With rec700 (wild- 
35 type)-infected cells, only small amounts of TNFRl were detected (lane c). With rf/748 
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(EaOa-, RJDp^ and ^ff798 (RIDa% RID|J-), high to intermediate levels of TNFRl were 
observed, indicating that RIDa and RIDp arc required for efficient clearance of TNFRl. 
When cells were co-infected with dJlAS and if/798, TNFRl was reduced to levels comparable 
to rec700-infected cells (lanes f and c). This result indicates that the mutants complement 
{^f/748 provides RTOP, ^/798 provides RIDa), and that both RIDa and RIDp are required for 
efficient removal of TNFRl from the cell surface. Figure 23B shows a standard immunoblot 
for E1B-19K from the same extracts that were analyzed for biotinylated TNFRl. Similar 
amounts of E1B-19K were detected with all viruses. Therefore, differences in TNFRl levels 
seen with these viruses are not due to differences in infection efficiency by the viruses. 

The partial clearance of TNFRl observed with these RIDa' and RIDP" mutants is 
consistent with the flow cytometry data in Fig. 19. These results suggest that there may be a 
mechanism in addition to RID that down-regulates cell-surface TNFRl in Ad-infected cells. 
However, clearly, most of the down-regulation of TNFRl requires RID. 

In summary, RID is required to remove TNFRl from the surface of Ad-infected cells. 
1 5 RID is also sufficient for removal of TNFRl as indicated by the experiment with the 23 1 - 1 0 
vector, with the caveat that the 23 1-IO vector also expresses other E3 proteins. RID 
expressed by the 231-10 vector is also sufficient to remove Fas from the cell surface, again, 
with the same caveat However, the down-regulation of TNFRl and Fas by 231-10 is almost 
certainly due to RID, because the mutant mapping data with E3 mutants have provided no 
20 indication that other E3 proteins play any role in down-regulating these death receptors. 
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Example 9 

This example demonstrates that the 23 1- 10 vector prevents rejection of human cancer 
cells transplanted into immunocompetent mice. 

Cells or tissues transplanted into immunocompetent recipients are usually destroyed 
(rejected) by immune killer cells of the recipient. Rejection begins within 1-2 days, and 
therefore is mediated by the irmate inmiune system including macrophages and NK cells. 
Specific CTL formed after about 5-7 days also play a major role in transplant rejection. As 
discussed above in Example 7, RID inhibits NK- and CTL-killing of Ad-infected cells and 
thus should also be able to inhibit NK- and CTL-mediated rejection of transplanted cells or 
tissues. 

This idea was tested by determining whether the E3 proteins expressed by the 231-10 
vector will permit human cancer A549 cells to grow as a tumor in inmiunocompetent 
CS7Biy6 mice. Human cancer cells normally will be rejected when transplanted in 
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C57BL/6 mice. However, RID should inhibit rejection by removing Fas and TNFRl from the 
transplanted cells. E3-14.7K may also prevent rejection. 

A549 cells mock-infected or infected with 50 PFU/ccll of 23 1 -10. After 48 h, 2 x 10* 
cells (in 100 were injected subcuianeously into each hind limb flank of female C57BL/6 
5 mice. At 18 days post-injection, the mice were sacrificed and the site of injection was 

exammed following removal of the skin. With mice that received mock-infected cells, there 
was a pin-point mass on one flank, and no mass at all on the other flank (data not shown). 
With the 23 l-lO-infected cells, there were significant tumor masses on both flanks (Fig. 24). 
The tumors were opaque and ellipsoid in shape. The left-flank tumor was attached to the 
10 muscle. The right-flank tumor, which is shown in higher magnification in Fig. 25, was 

attached to both the muscle and skin. The size of the tumor obtained with 23 l-lO-infected 
cells was many times larger than what would be observed from the initial bolus of cells 
injected (2x10* cells are barely visible to the naked eye). Thus, the cells grew into a tumor. 
In the second experiment, mock-infected and 23 1-10-infected A549 cells (at 2 days 
1 5 p.i. in culture, 50 PFU/cell) were used, both live cells as well as cells that were killed by 

freezing and thawing. These cells were injected into each hind limb of C57BL/6 and Balb/c 
mice, 2 x 10^ cells per injection. As is the case with C57BL/6, the Balb/c mice are fully 
immunocompetent. There were four mice of each strain. Mouse 1 received killed uninfected 
AS49 cells, mouse 2 received live A549 cells, mouse 3 received killed 23 l-lO-infected cells, 
20 and mouse 4 received live 23l*10-infected cells. Mice were harvested at 15 days following 
injection. No tumors were observed in either mouse strain with killed cells. With the 
C57BL/6 mouse that received uninfected live ceils, there was no growth on one flank and a 
very small mass on the other flank. With the Balb/c mouse that received live uninfected cells, 
there were small flat masses on each flank. However, with both the CS7BL/6 and the Balb/c 
25 mouse that received 23 l-lO-infected ceils, there were much larger eiipsoid masses (tumors) 
on both hihid flanks. These tumors resembled the tumors shown in Figs. 24 and 25. 
Therefore, as was the case in the fu-st experiment, the 231-10 vector allowed A549 cells to 
form tumors in immunocompetent mice. 

One of the 23 1 -infected cell tumors from the C57BL/6 mouse was examined for 
30 expression of the E3 proteins known to be synthesized in cultured cells by 231-10. Proteins 
were extracted from the tumor, and the EUDp, 14.7K, and gpl9K proteins assayed by 
immunoblol. As shown in Fig. 26, all three proteins were detected. This result provides very 
sn-ong evidence that the cells onginally infected with 23 1-IO. at the very minimum, persisted 
in the mouse. It is very likely that these cells grew as well, considering that tumors were 
35 fonned. It is not likely that the 23 1- 10 vector replicated in these cells, because the vector 
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lacks the El A gene. Most likely, as the A549 ceils proliferated in the mouse, a portion of the 
input vector was segregated into the daughter cells. 

In summary, the E3 proiems expressed from the 23 1- 10 vector have permitted the 
grovrih of human A549 cancer cells to form tumors m C57BL/6 and Balb/c mice. The tumors 
5 would not have been able to form unless they were protected from destruction by the immune 
system. Tnese results argue strongly that the E3 protems should prevent immune rejection of 
other r\pes of transplanted cells and tissues. Thus, the 23 1 -10 vector has the potential to be 
used m tissue or cell transplants to prevent rejection of the tissues or cells. 

Example 10 

1 0 This example illustrates the construction and propenies of the 23 1 -10 vector. 

Feapjres of 23U1Q 

Tne 23 1 -10 vector is a human adenovirus serotype 5 (Ad5) vector. It can be viewed 
as a ''transient tnuisfection" system, analogous to that obtained when a plasmid expression 
vector is transfected into cells. The basic features of the 23 1-IO vector are outlined in the 
1 5 schematic shown in Fig. 27 and the ennre DNA sequence of the genome of 231-10 is given in 
Fig. 2S. 

The honzoniai bar in Fig. 27 depicts the linear double-stranded DNA genome. The 
base pairs (nucleotides) are numbered from I to 34427 (see Fig. 28), firom left to right in Fig. 
27. Nucleotides 342-3S23 are deleted, removing all the genes in the Ad ElA and ElB 
20 transcription units (coUecuvely known as El). Nucleotides 28133-30818 are also deleted, 

removing all the genes in the E3 transcription unit. In place of El, an expression cassette has 
been inserted, in which the E3 genes are expressed from the hizman cytomegalovirus 
immediate early promoter-enhancer (CMV). This E3 expression cassette contains the E3 
genes from the virus named pm734.l, which is a derivative of the virus named reclQO 
25 (Tollefson et al., Virol. 220: 1 52- 162. 1 996). rec700 is an Ad5-Ad2-Ad recombinant that has 
the Ad2 version of the E3 genes for the 12.5K. 6.7K, gpl9K, and RIDa proteins, and the Ad5 
version of the E3 genes for the RIDP and 14.7K proteins. The E3 cassette in 231-10 contains 
all the E3 genes from p/n734.1 . Notably, there are two missense mutations in the adp gene 
(which encodes the Adenovirus Death Protein [ADP], previously named E3-1 1.6K) 
30 (Tollefson et al., supra). These two mutations eliminate the first two methionine codons in 
the adp gene, thereby precluding synthesis of functional ADP (Tollefson et al., supra). 

The 23 1 -10 vector was designed to have the following properties. First, since the 
El A genes are lacking, the vector should not replicate (efficiently) on most cell lines. 
Therefore, Ad early and late proteins will not be expressed and Ad DNA will not replicate. 
35 (It is known that Ad mutants lacking ElA do replicate their DNA and express late proteins at 
low levels when high multiplicines of infection are used and the infection is allowed to 
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proceed for several days. This is also true for 231-10 [not shown].) Second, the E3 proteins 
should be expressed in an El A- independent manner from the CMV promoter/enhancer. 
Thus, 23 1-IO is an essentially inert vehicle that can deliver the Ad E3 proteins into cells 
vvuhout havmg other Ad proteins expressed, at least for the first approximately 3 days 
5 following mfection. Even after 3 days, other Ad proteins should be expressed only in very 
small amounts, much less than the E3 proteins. 
Construction of Ad 231«1Q 

(a) The genes of the E3 transcription unit were excised from pm734.l (pm734 J is 
reclOO with mutations of the Met I and Met41 codons m the adp gene. recTOO is the same as 

1 0 .\d5 but with the Ad2 EcoRI-D fragment substituted for the corresponding Ad5 EcoRI-C 
fragment). The/7/n734.l Sr£I-NdeI-D fragment (3560 bp) was blunt-end using the Klenow 
enzyme and cloned into the Smal site of the pBluescriptSK(+) vector (Stratagene), resulting 
in plasmid pl72l which has the whole E3 transcription unit of pm734.1 (-39 to 3521) flanked 
by Sall-BstXI-SacII-Notl-Xbal-Spel-BamHI sites situated upstream from the E3 sequences 

1 5 and Pstl-EcoRI-EcoRV-Hindlll-Clal-Sall-XhoI sites situated downstream from the E3 
sequences. 

(b) ITie BamHI-Sall-A fragment (3605 bp) of p 172 1 was subcloned between the 
BanaHI-XhoI sites of plasmid pCDNA3.1zeo(+) (Invitrogen), resulting in plasmid pi 81 in 
which E3 genes are under control of the CMV promoter-enhancer. 

20 (c) The Mfel-Clal fragment of p 1 8 1 (4328 bp), corresponding to the CMV promoter- 

E3 genes from thepm734.1 expression casseUe, was subcloned between the EcoRJ-Clal sites 
of plasmid pAElsplA (Microbix Biosystems Inc., Toronto), resulting in plasmid p23 1 which 
has the CMV-E3 expression cassette flanked by Ad5 genomic sequences (Ad5 map units O-l 
and 9.8-16,1). The orientation of the CMV.E3 expression cassette is right-to-left (opposite to 
25 the Ad E I and major late transcription units). 

(d) Plasmid p23l was cotransfected along with plasmid pBHGlO (Microbix 
Biosystems Inc., Toronto) into 293 cells resulting in plaques of recombinant virus 23 1-IO. 
The virus has deletions of El (Ad5 nt 342-3523) and E3 (Ad5 nt 28133-30818), and has the 
CMV-E3 expression cassette in place of the El deletion. 
30 The 23 MO Vector Expresses the E3 RID. 14.7K. and gDl9K proteins . 

The E3 proteins arc expected to be s>'nthcsi2ed from the E3 expression cassette in 
23 1 -10. To demonstrate that this is so, separate dishes of A549 cells were infected with 250 
PFU/ccll of 231-10, then at 0-5 days p.i. protein extracts were examined for the E3 RID, 
14.7K, and gpl9K proteins using standard immunobloi procedures (Tollefson et al.. Nature 
35 392:726-730 (1998)). In one dish, 231-lO.infected cells were treated with l-fJ-D- 
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arabinofuransylcytosine (araC) at 2 h p.i., then proteins were extracted at I day p.t. RID(3, 
14.7K, and gpl9K were readily detected at 2 days p.i., and their abundance increased until the 
end of the experiment at 5 days p.i. (Fig. 29, lanes d-g). On longer exposures of the gel 
shown in Fig. 29, a trace of RIDP, 14.7K, and gpl9K can be seen at I day p.i. (not shown). 
5 In the experiment shown in Fig, 29, one dish of cells was treated with araC. AraC 

inhibits Ad DNA replication, and therefore Ad late genes cannot be expressed. As shown in 
Fig. 29, small amounts of RIDP and gpl9K were detected in the araC-trcated cells; 14.7K was 
also detected in longer exposures of the gel (lane A). Therefore, as expected, E3 proteins are 
synthesized by 23 1- 10 without replication of the vector Ad DNA. 
1 0 These results demonstrate that the RIDP, 14.7K, and gp 1 9K proteins are expressed in 

23 1 - 10-infected cells. In another experiment, the levels of RIDp at 4 or 5 days p.i. were 
roughly similar to those of rec700-infected cells at I day p.i. (see Fig. 22). Bearing in mind 
that reclOQ has replicated by I day p.i. and therefore has expressed higher levels of RIDP 
from more templates, the quantities of RIDP, 14.7K, and gpl9K observed with 231-10, which 
1 5 does not replicate (or only replicates in small amounts at 4 or 5 days p.i.). are quite high. The 
synthesis of the E3 12.5K and 6.7K proteins by 23 1-IO has not been examined. Although not 
shown directly in Fig. 29, the RIDa polypeptide is also expressed by 231-10. This can be 
deduced from the observation that 231-10 exhibits the expected functions of RID, namely it 
clears Fas and TNFRl from the surface of infected cells (see Example 8.). These functions 
20 require both RIDa and RIDp. ^ 

Indirect immunofluorescence was also used to study the expression of the gpl9K, 
RIDP, and 14.7K proteins in A549 cells infected with 231-10, At 2 days p.i., the gpl9K and 
RIDP proteins were visualized as described previously (ToUefson et al.. Nature 392:726-730 
(1998); Hermiston et al.,/ ViroL 67:5289-5298 (1993)) and the 14.7K protein was stained 
25 using a rabbit antiserum directed against a TrpE- 14.7K fusion protein (ToUefson and Wold, 7. 
Kfro/.62:33-39 (1988)). Strong staining of gpl9K was observed in a panem typical of the 
endoplasmic reticulum (Fig. 30A), as has been observed with rec700 (Hermiston et al., 
supra). The pattern for RIDP was also similar to that seen wuh rec700, i.e. staining of the 
Golgi, other membranes, and the plasma membrane (Fig. 30B: ToUefson et al.. Nature 
30 392:726-730 (1998)). The 14.7K protein staining was diffuse in the cytoplasm (Fig. 30C), 
which again is typical of rec700 (unpublished results). These results establish that the E3 
gpl9K, RID, and 14.7K proteins localize to the same or similar intracellular compartments as 
they do in wild-type Ad-infected ceils. 

In view of the above, it will be seen that the several advantages of the invention are 
35 achieved and other advantageous results attained. 
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As various changes could be made in the above methods and compositions without 
departing from the scope of the invention, it is intended that all maner contained in the above 
description and shown in the accompanying drawings shall be interpreted as illustrative and 
not in a limiting sense. 

All references cited in this specification are hereby incorporated by reference. The 
discussion of references herein is intended merely to summarize the assenions made by their 
authors and no admission is made that any reference constitutes prior art. Applicants reserve 
the right to challenge the accuracy and peninency of the cited references. 
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What is Qaimed is: 

1 . A method for inhibiting apoptosis of a cell comprising treating the cell with an 
effective amount of a Receptor Internalization and Degradation (RID) complex. 

2. The method of claim 1 wherein the treating step comprises administering to the cell 
a polynucleotide encoding the RID complex and wherein the RID complex is expressed in the 
cell. 

3. The method of claim 2 wherein the polynucleotide comprises a recombinant 
adenovirus vector. 

4. The method of claim 3 wherein the recombinant adenovirus vector is 231-10. 

5. The method of claim 3 wherein the cell expresses Fas, TNFR-l, DR3, TR-\IL-R1, 
or TRAIL-R2. 

6. The method of claim 5 wherein the cell is a leukocyte. 

7. The method of claim 5 wherein the cell comprises a transplant tissue. 

8. The method of claim I wherein the treating step comprises administenng the RID 
complex to the cell. i 

9. The method of claim 8 wherein the RID complex is administered with a carrier 
which facilitates delivery of the RED complex into the cell. 

10. A method for decreasing apoptosis of target cells in a patient comprising treating 
the patient with an effective amount of a Receptor Internalization and Degradation (RID) 
complex. 

1 1. The method of claim 10 wherein the treating step comprises administering to the 
patient a polynucleotide encoding the RID complex and wherein the polynucleotide is 
internalized in the target cells and the RID complex is expressed. 

12. The method of claim 1 1 wherein the polynucleotide comprises a recombinant 
adenovirus vector. 

1 3. The method of claim 1 2 wherein the recombinant adenovirus vector is 23 1 - 1 0 . 

14. The method of claim 10 wherein the patient suffers from a degenerative disease or 
an immunodeficiency disease. 

15. The method of claim 10 wherein the treating step comprises administering the 
RID complex to the patient. 

16. The method of claim 15 wherem the RID complex is administered with a carrier 
which facilitates delivery of the RID complex mto the cells. 

17. A method for decreasmg leukocyte apoptosis in a patient comprising: 

(1) withdrawing leukocytes from the patient, 

(2) treatmg the leukocytes with an effective amount of a RID complex, and 

(3) administering the treated leukocytes to the patient. 
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18. The meihod of claim 17 wherein the treating step comprises administenng to the 
leukocytes a polynucleotide encoding the RID complex wherein the RID complex is 
expressed in the leukocytes. 

19. The method of claim 18 wherein the polynucleotide comprises a recombinant 
adenovirus vector. 

20. The method of claim 19 wherein the recombinant adenovirus vector is 23 1- 10. 

21. The method of claim 17 wherein the treating step comprises administering the 
RID complex to the leukocytes. 

22. The method of claim 21 wherein the RID complex is administered with a carrier 
which facilitates delivery of the RID complex into the leukocytes. 

23. A composition comprising a Receptor Internalization and Degradation (RID) 
complex and a earner suitable for facilitatirig delivery of the RID complex into a cell. 

24. A recombinant adenovirus comprising a polynucleotide encoding a Receptor 
Internalization and Degradation (RID) complex operably linked to a promoter, wherein the 
adenovirus is replication defective and wherein the polynucleotide is expressed upon infection * 
of a eukaryotic cell with the adenovirus. 

25. The recombinant adenovirus vector of claim 24 consisting of 231-10. 
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Mock -93% 

231-10 (E3+ vector) 24 hr. p.i. - 35% 
231-10 (E3+ vector) 48hr. p.i. - 11% 

Figure 20 
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Figure 24 
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Figure 25 
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Figure 26 
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caccaccaacaacacaccccaccccggaccgaagccaacacgacaacgagggggcggagc 

1 ♦ ♦ ♦ 60 

gcagcagcwaccacacggaacaaaacccaaccccggccacaccaccacccccccacccca 

Ecil 
1 

::igcgacgcggcgcggggcgcgggaacggggcgggcgacgcagcagcgcggcggaagcgc 

61 ♦ ♦ ♦ * ♦ 120 

aacaccgcaccgcgccccgcacccccgccccgcccaccgcaccaccacaccgcccccaca 

Afllll 
BspLUllI 
Ecil INspI 
111 

gacgccgcaagcgcggcggaacacacgcaagcgacggacgcggcaaaagcgacgcccccg 

121 ♦ ♦ — 180 

ccacaacgcccacaccgccccgcgcacacccgccgcccacaccgcccccaccgcaaaaac 

BsrGI 
3srFI I 

SgrAI TatI Ecll 
I I t 
gcgcgcgccggcgcacacaggaagtgacaacccccgcgcggccctaggcggacgctgLag 

131 — — — ♦ ♦ 240 

cacacgcggccacacgcgccccccaccgccaaaagcgcgceaaaacccgcccacaacacc 

Hael 
Msci 

Apol EaeZ 1 

It 

caaactcgggcgcaaccgagcaagacccggccatccccgcgggaaaaccgaacaagagga 

241 ♦ • — ♦ — ^00 

acccaaacccgcaccggcccaccccaaaccggcaaaagcgcecccccgacccaccccccc 

Apol 

Hindlll EcoRI 
Clal 1 EcoRV I 
t i II 

agcgaaacccgaacaaccccgcgccacccacagcgegcaacaccgacaagcccgacaccg 

301 ♦ 4. ♦ ♦ ♦ ♦ 360 

ccaccccagacccactaaaacacaacgagcatcgcgcaccacagccactcgaaccacagc 

Bcivi 

P3CI i 

SfCl I I EC057I BssHIZ 

til 1 I 

aacccccgcagccccacggacaeacggggccgaaggcacccccagacggccccgcgcgcc 

361 • ♦ ♦ ♦ 420 

c caaggacgc cggga tacccacgcgc cccaac c cccacagaagcccgccagaacgcgcga 

Taqll 

I 

ccacccgcaacaacacgaagacagcgggcgcggatggacaggaacaggaggaaaccgaca 

421 ♦ ♦ * 480 

agtagacgtcgocgcacccccaccacccacgcccacccgcccccgccccccttcgactgc 

xcml Acelll •* 

I \ 
ccccacccagaccgcggagaaagcccgcagecaggaggaagecgcaacaccagagccggg 

481 ♦ ♦ • ♦ * 540 

aaggcaaatccaacacctcccccaaacgccggtccccccccgacgccacggcctcgaccc 

BseRX BspGX Apol 

1 I I 

aggagggcaaggaggcgccgccgaacaaaccggacagaaacccgctaaccgaccccaagc 

541 ♦ . • . ♦ ♦ 600 

cccccccgcccccccacgacgacLcacttgacccgtcctcaaacgactgaccaaaactca 

Bg^?^^ Figure 2 8A 

B9CYZ I KgiEII 
\ I I 
aagcgacgctccactacccctccccaccagccaaagggaacaagacccccgagaccgcac 

601 — ♦ ♦ ♦ ♦ * 660 

cccaccacgaaacaacaaaaaaaaacaaccaatcccccccaccccagaaaccccggcgcg 
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PspSlI 
Sse8647X 

BacYI I i Bsgll Bsml 
1 I I M I 

agggccccaacaagggtgcagagacccccaggcccccgaeaaggcgagcgaacgcagccc 

66X ♦ ♦ ♦ ♦ ♦ ♦ 720 

ccecagaaccacccccacgcctccaggagcccaggaaccgccccacceacccacgccgga 

ccggcccccaccgagcgccgagccacggcaacgggcccttcceecaccacgaecaccaac 

721 ♦ ♦ ♦ ♦ ♦ ♦ 780 

agccaaagacggcccaegacccaacaccaccacccgaaaagagggcggcaccggcggcca 

BpulOI BsaWI P£IX106I 
I I I 
ccccgacgcccggccggcaacccgcagccaaggcggcgcecggtggcaccaccgccgcag 
731 ♦ ♦ ♦ 840 

aagaccgcgaaccaaccgccgaacaccgaccccgccacaggccaccacaacgacagcacc 

Hindi 

Hael Hpal 
t t 
gcgaccccggcccgccccaccagacaaaagacaccccccccgcaccggtgcaagccaacc 

841 * ♦ ♦ ♦ ♦ ♦ 900 

caccgaaaccggacgaaa cggcc cgc cc c ccacggggaaaacgcgaccacgcccaa c egg 

MspAXI 

Banll BslEI 
BsiHKAZ Sael 1 

B3P1286I Sagi I 

PflHI Sad GdiZX I 

It II 
acgccccggageccccgacccacgcgccgccgcccggccgecgccccgcgcccccccagc 

901 — ♦ 950 

cacagaaccccgagaaccaagcacgcgacaacgagceggcgacgggacgcagaaagaccg 

Kaell BglZZ 
AlwNI I asCYI 
3SCAPII xhal I BaeRI 

11 lit 
aggcgccgccccgcaacaaccccgcccacccccagacccagggcgccagccacccccccc 

961 ♦ ♦ ♦ ♦ ♦ ♦ 1020 

c ccgcga cgaga ca c ca c c aaggcaggcaaagac ccaga cccca eagccagcagaggagg 

ngccagaccaaagtagccgaccccagcgggggcgggagaagcggggcgaggccgatcggc 

1021 ♦ ♦ ♦ ♦ ♦ ♦ 1080 

acaa cccaac c c ca ccgaccaaagccaceeccaeecc cc c caccccgcccegactaaccg 

BsrFt 

NgoAlv BC3I Taqll 

I I 1 

cgggacaaagccgccggcaacaacccgccgcagcggaagcacagcgggcgeggggaaagc 

1081 ♦ 1140 

accccgccccggcggccgccgccgaacaacgccacccccgcacegeccgcgcccccccca 

Ordll Bsp24X Bsp24I 

I I I 

wgggcggcccacggcacctacccgcccccagccaacgccaaggcagggacacacagccag 

1141 ♦ ♦ ♦ ♦ ♦ 1200 

acccaccaiigcaccgtagacaagcaaaggccggccacagccceacccecacacaccgace 

PSCX 

sfd I 
I I 

ggctaagaeggcaccgcagaacaccacaacagagacgaccgcacacaaceaggccccgga 

1201 • * ♦ ♦ ♦ 1260 

ccgaccccaccacgacgccctgcggcaccgcccccaccaaegcacaceggcccgaagccr 

SspX 

I 

aagaccgccccccccactgcagcaacccggaacaccccacacacgagcgaacccgcacga 

X261 1320 

ccccagcgaaaaaagcaacaccgccgaaccccacaaggcacacgcccacccagacgcacc 

cacacgcccccgaggcccggaggccggggaacaaaacgcagacagggtgcaaacaaccag 

1321 — - ♦ ♦ • 1380 

acacacagaaaccccgaacccccagccccccgecccgcgcccaccccacgcccaecagcc 

Apol 

Apol EcoRZ Sfd 

I I I 

cagaaaagccacagcaaaccccacaaccaaagaactccgagaagaccagecacagccccg 

1381 ♦ ♦ * ♦ ♦ 1440 

gcccccccagcgccacccaaagcaccaacccctcaagacccccccagccgacaccaggae . 

Banr"^ lllV: B«r: Figure 28B 

II M I 
cccccgcaccgcggacggcgcccgaggcacgcaacgcgcacacaaacecagceaacgaac 
X44X - XSOO 
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agagacacaaegcccaccacggaccccacgcgctacgcgcgcgcccgggccagccocccg 

Psci 

Sfcl sfcX I BspMI 
I II I 
cgaacgaaggcgacgaccacagcgacgaggccgcagacgaggacaagggtgacaaacccg 
1501 ♦ — - ♦ ^ 1550 

acecacccccgccaccgacgccaccgccccgacgcccaeccecacccccaccgcccaggc 

Eael 
Gdili 
Sac 1 1 

MSPAIII 
BsiBI ( ( 
BSCOSI I t 
I I I 

caaagcaggcaaaccgcgaaaggcgggacgcaatctactccgacgtgagcgaccgcggcc 

^561 - ♦ 1- ♦ ♦ 1620 

acctcgcccactcgacaccccccaccccacgccagacgaagccacacccgccggcgccgg 

Aval 

BslHXAI Smll 
BSPX286Z Xhol 

I i 

aacgcagagcacgcacagaaaagcgcaaca&gggccaacaacacaagaacccgaggaacc 
1621 ♦ ♦ ♦ ♦ ♦ ^ 1680 

ccacacctcgcgcgcgcccccccgcgttgcccccagtcaccacacscccgagctcctcag 

Earl Apot 
I I 
acgccccacccaaccacaccgcaaaagaagagaacacggccccagaccgcccaacccacg 

♦ ♦ ♦ ♦ ? 1740 

cacagagcaaaccagcacgacaccccccccccccgcaccaaagcccggcaggccagacac 

Sfcl 

RleAZ BslHKAX Hael I HaoIV 

Xnnl 3SP1286I Scul 1 Hln4I 

i I I I I 

aaccccttcactgcgcgggctgagcacaacgacaggcctacagacggggggtctggcgcg 

l'^*^ ♦ ♦ * 1800 

ccaaaaaagcaacacacccaaccegcgccaccacecggacaeccaccccccagaccgegc 

B3p24X 

HaoZI I 
Tchllllt 1 
EC047IXI I I 

I \ \ 

tccgcgccccaggeaacaaacaagccacacaacaacaaggcaaacaaaeacaagcgccac 

i80I ♦ ♦ ♦ 1360 

agacgcgaaacccgccgcccacteggegcaccaccaccccgcccgcccgcacccgcgaca 

3sp24X 

"1111' BmrX I Apol 

> II I 

ggaaaaccaccacaagcccaagcccgceeagccaecgacaaaggcacgaacccggggcaa 

13S1 ♦ ♦ ♦ 1920 

ccccccggcggcgcccaggcccgagcgggccagcaaccgcceccgcacccgaaccccaec 

RUAX 

Bsawx BCSX BsbX I 

I III 

acccagggcagacgccagcccggtagcagtggcgccgcgacagtccgccgcgggcgcgac 

1921 ♦ ♦ ♦ ♦ 1980 

caaaccccgcccacaaccaggccacegccaceacaacgccateaggcaacacccgcgcca 

Hindi EcoSTI 
BsrFI I BpmX Bell Apol I 3CSl 

t 1 I I I t I 

ggtcgagccggccaaccccggagcaggcaagccgaagccgggcccgaccaaacccgcagc 

1^81 ^ — - ♦ 2040 

ccaacccggccagccgagaccccgcccgttcgactccgacccaaaccagcccaaacgtca 

HaeXX HaeXI 
AiwNXt Bsgl I BspGI RleAI 

II til I 

9caggcgccggcagaaaccaggcgccaacgcccaggaaagcccgacccgaaggccgcggg 

20^1 - ^ 2X00 

cgcccgcgaccgcccccagcccgcgaccgcaggcccctccaaaccaaacccccaacaccc 

BspCI 
BpmX RleAX I 

I I I 

tacaacctcgcccgcctggagcacaccccacacagagcaaaccgcccaggggaatacaag 

2101 - 2160 

acactagaacgggcggaccccgcacagggcgcaccccacccaacaggcccccccacgctc 



Tchiuii Figure 28C 

ccccccaccaacaaaaccgcgcccgcccccgcace 
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gcccgccccccagccccgcaaaagaaaagcagccaccccgacgcagacgaaaacacddac 

Haell 

Eco47rii I BsrBl Bani 
II I I 
agacaaagcaaggcacacaccaaagcaagcgccgcaacaagcagagcggcggaacaaaag 
2221 ♦ ♦ ♦ ♦ 2280 

cccacttcactccacgcacggccccgcccgcgacatcacccgcctcgccaccccgccccc 

MSII 
BsrGI i 

RleAI MslI Tchlllll Taci Taci I 

II II II 

gcgccagtgccccccaaacactcctgcgggggccacaacccgcaccgcccgcccacgcac 

2281 - — ♦ * * ^ 2340 

cacggccacaagagacccgcgaaaacacccccggcgccgaacatgacaaacgagtacacg 

AqoZ 

I 

acggcaacaccgcacaccccacaaaacggaaacccacacacaaaagccctacgaccccca 
2341 ^ ^ ^ ^ ^ ^ 2400 

caccaccacagcgtgcaaagcaccccaccctcaaacacgcacccccaaaacgccaaaagc 

Scyl Bbsl DrdI 

1 I I : 

ccctggaagaccgcgacaccacagccgccagcgccacccggccgccaaacagcacacaca 

2401 ♦ ♦ 2460 

ggaacctcccgacaccgcaacaccagcaaccacagcggaccgacggctcacegcacacgc 

KlndZXI 

I 

gcacacccgccaacctcgcccccgcggcgaacaacaagccccccacgccccgcggcgcat 

2*61 - ♦ ♦ ♦ 2S20 

cgtatgaacggccaaaacagaaacacc^cccaccactcgaaaagcaca^acaccacgca 

oral a 
Bsmi swal " BsrOI -m DrdI I 

I I • I I 

cccacaagagcagcgcacccactagcccccgacccaaacgcaacaccgcaagccggcccc 

2521 - ♦ - * -r* 2530 

aaacacccccaccacgcaagcaaccgaagaccaaacccacaccgcaac^cccgaccaagg 

Kaell 
Sfcl 

Eeo47lll iPsci Bgll 
1 I 1 I 
ccaaacccaacccccccggcagcgccgcagaccgccgcaagggegagcaagcccaaaacc 

2581 ♦ ♦ - * 2S40 

aacccgagccggaaaaaccgccgcgacgcccgacggcgcccccgcccgcccggaccccag 

Oral Bael 
I I 

acgcaccccaccccggacgccgcccccagcgcccaaaaagccgacaacaggcacaaacgc 

2641 * ♦ ♦ ♦ ♦ 2700 

cacacggagcagaacccacaacgggggccgcaaaccccccgaccgccacccacg^cigca 

asgl 
35U36I I 
Bael I 1 MS II 

... I 11 I 

gcgcgcagcaggcggcaaccccaaggcacagaagcgccagcacaagaacaaacagaaLca 

2701 - * ♦ 2760 

cgcacgccgcccgccgccgggaccccgcgcccccacgaccacaccctcacctgccccaac 

Hindi I I 
\ 

caagagcaaggacaaccccgaccccaaccccagaaaaaccagacaagcccgcagagccac 
2761 ♦ ♦ ^ ^ 2320 

gcccccacccccaccggggccggggccaaggcccccccaatccgtccgaacaccLcaatg 
Pad 

ccgaactgcccacacacccaaccaaaaaaccccagcaccccgcaaaacgcccccccgacc 

2821 ---- ♦ - . — 2880 

aacccaacgagcacacgaaccaaccttc^agggccgcggggcgccctacgaaaaaaccgg 

Banll 
BslHKAI 

3SP1296I „ . ^ 

Hin4i Fxgure 28D 

Hin4I sad Bpll Bpll 
I.I 11 
cgagccccgggagccgagcccacctcccgccctggaaaaacgggagcaacgcczggccac 
2381 ^ ^ ^ 2940 

. acccaaggcccccaacccgagcggaggacaaaaccc^cccacccLcaccacagaccaacg 

SCd Bsawi 

RleAI I BsrFI 3SPMI 

Bpuior I I • PlnAI Sum i Aarl 

111 I ! t I 
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gcccaggccgcaggcgcgggcgcagcaaccggcgacgcacccgcacgcccccgycdsjgcg 

* * 30C0 

cgagcccgacacccacacccgcgccgccggccaccgcgcgagcacgcaagggccgcccac 



31 



BseRI 
I 

aggagggcggtggcggtggtgccccccctgacggcgcagccgaagccgagaaggccgcgc 
cccccccaccaccaccaccacaaaaagaaccgccaca ccaaccccggcccc cccaacaca 

Earl 

Bsmaz Sapl 
I I 
ggcaaacccactccgccccgccggaaaccgccgtaaaccacaaacgaagagccgccaaag 

ccgcccgaacgaagcagagcgacccccgacaacacCwaacgcccacctcccggcaacccc 
HgiEII 

SexAI I asawi BsaXI ' Taqll 

' • lit 
caccaggcaaggcacctaccggcccgcccgcgcaaaccggaggcgaggtccgccccggcc 



3001 



3061 



3060 



312-: 



3182 



acggcccaccccacgaacaaccgggcgaacacgctcggcccccaccccaaacgaaaccag 

Bmrl 
Banll I 
BsaXI I 

I 1 

tgccccgggcgggcaaaaacggcggcgctcacaggacggcgacaggagccccagcagacc 
acgaaacccacccacccccgccaccgcaagcgccccaccgczgcccccggggccacccaa 



3130 



3240 



Bgiri 

MSII BSCYI 
t I 

ccaacccccgtacccaccacacccagcacagagacgacaacaaagaccccgacgcaaccc 
3241 . ♦ * ^ ^ 330Q 

gaccaaagaeacaaacaacacgagccgcgcccccaccgccgcccccagaaccacaccagg 

ECOO109I 
PspSlI 

BscOSX SanOI BsrBI BsrBI 

! Ill 
agggccaggacagccgcaaaccacggzcagaacacagggaccccgcccccgccccaccag 



3301 



3361 



3360 

ccccaaccccgtcaacgctcggtgccagccccgcgtccccggggcgagggcgaggcgacc 

SsaAI 

Haell AflllllTaqir 
I It I 

cagggggcgcccggcaaacccccgaatcaggccacgcgcaagccctacccgggcggcgag 

♦ * ♦ ♦ ♦ 3420 

gccccccgcgaaccacccgagggcccagcccgacgcacacccgagacggacccaccaccc 



3481 



Apal 
Ban 1 1 
Bspl286I 
Bmgl t 

aseSI I Earl 
BsaKX ECOO109II I Sapl 

I 111 t 

ccggacgccgcgcgccgggcecccgacacgccccccgggcaacccaaagcaacaaaaccc 
^*21 ♦ • ♦ ^ 3480 

ggcccgcggcacgcggcccgggagccacacgagaagcccgccaagccccaccgccccgag 

Sacll ■ 
MspAlII 

Bsawi Bscbsi I I Bcsl 

I I n I 

accggagccgcgggcaaagcacctgcggcggcggcagcggccgaggcgcgccaggcgcag 

" ♦ 3540 

cggccccggcgcccgccccgtgaacaccgccgccgccaccagccccacacagcccgcgcc 

PflllOSi Ecll 

I \ 

ccgccccgcccccccaccggccacccagccgcagccgcccgccgagtcccccaccgcgtc 

3541 — - 3600 

agcgagacggagaggcgaccagcaagccagcaccggcaggcggcccagaaagtggcgcag 

c^i!i B«pci Figure 28E 

I I . 
, ***9Ccgggaacaaact:ggcccgggcagcggccgggaggcccagaaaaggggccgaagca 
3601 * - 3660 

ccccaaccectacccgaceaggcccaccaccggccccccaggccccccccccaaccccac 

Bsawi 

BseRI BspEI Hin4I 
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aaccgaa?gcac9aaccccccaacaaacC9c:a9agccccaac?cccccggag^v*^^>J^cc 

3661 ♦ * ♦ 3720 

ccggcccccgcgcccgaggagccacccaacacctcaaggccacggaggccccgcgccgag 

Psci Eael 
BplI I Earl GdiXI 

TChllll secl iKinai ' Sapl BsrBIl 

I 1 I I I II 

cgaggacgaggcecgcagagtcaggaccgcecgacggggcgcaaacgaagagcggccagc 

3721 ♦ ♦ * • ♦ ♦ 3780 

gcccecgccccagacgccccaaccceagcggaccgeeccgcactcaetccccgccggteg 

BseRI 
Acelll 
BanIZ 

Bsal BsiHKAI 
Ssawi I 3SP1286I 
Haell BspEI I Hin4I Sad 

I i i 1 i 

gccgccgacccgaaacgccccgtccggacggagaccaagagaggagcccaccgacccgtc 

3781 ♦ ♦ - * 3840 

cggcggccagactctacagggcaggcccgcccccggccczctccccgagtggccgagcag 

Stnal BamHI 

BplI Aval I BscVI 

t lit 
gccgagccgaacaccccgccccccgaccctcaggcgagccacaccccgcccgggggGCAT 

3841 . — • ♦ ♦ 3900 

caacccgacctacggagcgggagaccaaaagcceacccaacacgggacgggcccccCCTA 

Aflll 
Smll 
KindZIZ I 
Bani I ; 

BanIX I I t Nhel 

BsiHKAI I I i Bsp24I I 

Bspl286I i I t oral i I Stcl 

Hin4X Saci IKpnl \ I PmeX t 1 Baal I Bsp24I 

I I I I I I I I : II I 

CCGACCTCGGTACCAACCTTAAGTTTAAACGCTAGCCACCTTCGGTCTCCCTATAGTGAG 

3901 * ♦ * ♦ 3960 

GGCTCGAGCCATGGTTCCAATTCAAATTTGCGATCGGTCGAACCCAGAGGGATATCACTC 

^Banll 
BsiHKAI 
Bspl286I 

vspl Bcsl SacI 

t I 1 I 

TCGTATTAATTTCGATAAGCCAGTAAGCAGTGOGTTCTCTACTTAGCCAOACACCTCTOC 

3961 ♦ -i- ♦ ♦ ♦ — 4020 

AGCATAATTAAACCTATTCGGTCATTC G TCACCCAAGACATCAATCG G TCTC7CGAGACG 

ecli 
I 

TTATATAGACCTCCCACCGTACACGCCTACCGCCCATTTGCGTCAATGGGGCGGAGTTOT 

4021 — ♦ ♦ ♦ ♦ 4080 

AATATATCTCGAGGGTGGCATGTGCGGATGGCGGGTAAACGCAGTrACCCCCCCTCAACA 

Aatll 

Ban I BsaKi I 

... I i I 

TACGACATTTTGGAAAGTCCCGTTGATTTTCGTGCCAAAACAAACTCCCATTCACGTCAA 

4081 ♦ * ♦ ♦ ♦ 4140 

ATCCTGTAAAAC LTriX, AGGGCAACTAAAACCACG G ' iUnnX^I ' t " lU ACGGTAACTGCACTT 

Taci 

I 

TGGGGTCGACACTTCCAAATCCCCGTGAGTCAAACCGCTATCCACGCCCATTGATGTACT 

4141 ♦ ♦ ♦ 4200 

ACCCCAC C T C TGAACCTTTAGGGqCACTCACTTTGGCCATAGCTGCQGGTAACTACATC 

BSCDSI 
NCOl 

Scyl BsaAI 
Mslll SnaBX Taci 

II 11 
GCCAAAACCGCATCACCATGGTAATAGCGATGACTAATACGTAGATGTACTGCCAAGTAG 

4201 * ♦ * ♦ 4260 

CGGTTTTGGCGTAGTGGTACCATTATCGCTACTGATTATCCATCTACATGACGGTTCATC 

Bgii Fiaure 2 8F 

Taci Bmrl Bsp241 I BsaHI ^ ^ 

I i II I 

GAJU^GTCCCATAACGTCATCTACTCGCCATAATCCCAGGCGGGCCATTTACCGTCATTCA 

4261 ♦ ♦ ♦ ♦ * 4320 

CTTTCAGGGTATTCCAGTACATGACCCGTATTACGCTCCGCCCGGTAAATCGCAGTAACT 

Bsp241 

AacIZ I MdeZ TatZ BglZ 

II 11 \ 
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CGTCAATAGGGGCCCTACTTTCCATATGATACACTTGATGTACTCCCAAGTGCGCAGTTT 

4321 • ♦ ^ 4380 

GCACTTATCCCCCGCATCAACCGTATACTA7CTCAACTACATCACGCTTCACCCGTCAAA 

Taqll 
Aacll I 
BsaHI I I 
I I i 

ACCGTAAATAGTCCACCCATTCACCTCAATGGAAACTCCCTATTCGCCTTACTATGGCAA 

4381 - ♦ 4440 

TGGCATTTATCAGGTGGGTAACTGCAGTTACCTTTCAOGGATAACCCCAATGATACCCTT 

Adtll 

BsaHI I Bgll 

t I I 
CATACGTCATTATrCACGTCAATGGGCGGGGGTCGTTGGCCCGTCAGCCAGGCGGGCCAT 

4441 ♦ ^ 4S00 

GTATGCAGTAATAACTGCAGTTACCCGCCCCCAGCAACCCGCCAGTCGGTCCCCCCOGTA 

TTACCGTAAGTTATGTAACCCGCAACTCCATATATCGCCTATGAACTAATGACCCCGTAA 

4S01 ♦ * • 4S60 

AATGGCATTCAATACATTGCGCCTTCAGGTA7ATACCCGATACTTGATTACTGGOGCATT 

Afllll 
Mlul 

vspl spel . Hindi 1 

II II 
TTGATTACTATTAATA^CTAGTCAATAATCAATGTCAJkCGCGTATATCTGGCCCCTACAT 

4561 - - * ♦ * 4620 

AACTAATGATAATTATTGATCAGTTATTAGTTACAGTTGCGCATATAGACCGGGCATGTA 

Hindi 
Bsp24l Acdl 

Nrul BpulOI Hindu: Sallii 

I I I t 1 1 

CGCGAAGCAGCGCAAAACGCCTAACCCTAAGCAGATTCTTCATGCAATTcaagcCCQCCg 

4621 ♦ ♦ ♦ — ♦ 4680 

GCGCTTCGTCGCCTTTTGCGG ATTOSGATTCGTCTAAGAACTACCrrTAAgc CCgaacagc 

Bsp24I 

Bgiii Ann I 

BSCYI Sfflll I 

I II 
acagaccccgggcgcggcctaagggcgggaaagaacacacaaggcgggggccccacgcag 

4681 ♦ — ♦ ♦ ♦ — ♦ 4740 

cgcccagaacccgcaccgaacccccacccccceccacacaccecaccceeagaacacacc 

BsiKKAI 
39P1286I 

I 

ccccgcacctgccccgcagcagccgecgcegccacgagcaccaactcgcccgacggaagc 

4741 ♦ — ♦ — ♦ 4800 

aaaacacagaeaaaacgccgccggcggcggcggcacccgcggccgagcaaaccacccecg 

BscOSI 

Banll Neol 
BsiHKAI SCyl 
Bspl286l Nspl t 

Sad. . . sphi t Bpml 

f * II f 

accgcgagcccacactcgacaacgcgcacgcccccacgggccggggcgcgccagaacgcg 

4801 ♦ 4, ♦ ♦ 4860 

caacacccgagcacaaaccgccgcgegtacgggggcacceggceceacgcagccccacac 

Pflll08I 

Banll Bsal I 

Bspl286l Bsp24II I 

I II I 

&c9g9ccccagcaccgacggccgccccgccccgcccgcaaaccecaccaccccgacccac 

4861 ♦ ♦ ♦ ♦ ♦ ♦ 4920 

caeecgaggccgcaaecaccagcggggcaggacgggcgcccgagacgacggaaccggacg 

PstI 

Tthllll EC057II S£d 

Hin4It secl tl AlwNII 

Mmel M Bsp24I I M EciX KspAlIIPscI 

III I I II t III 

gagaccgcgcccggaacgccgccggagaccgcagcccccgccgccgccccagccgccgca 

4921 ♦ ♦ • ♦ ♦ 4980 

ccccggcacagaccccgeggeaaeccccgacgceggaggeggcggegaagccggcgacge 

SacII Banll • 

HspAiii Bspi286i BscAPi FxQure 2 8G 

B3CDSI I 1 BpulOI I BCSl 

III It 1 

gceaecgeecgegggaccgcgaccgaccccgcccccccgagcccgcccgcaagcagcgca 

4981 — — — - ♦ S040 

cggcggcgggcgccccaacaccgaccgaaacgaaaggaccegggcgaacgcccgccacgt 

BsgX 
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Tchlllll Ecii I Hinci: y.uni 

III ! t 

gccccscgcccacccgcccgcgacgacaagccgacggcccccccggcacadccg^accc^ 

5041 5100 

cgaagggcaagcaggcgggcgccaccgcccaaccsccgagaaaaccgcgccaacccaaga 

Alwt4l 

Smal MspAlI BspMI 

Aval I Mmel PvuII BscYI I 

II I ! I I 

ccgacccgggaacccaacgccgcctcccagcagc^gtcggacccgcgccagcaggccccc 

5ltfl 5160 

aaccgggccc-cgaaccacagcaaagagccgccgacaacccagacgcggccgcccaaaga 

Tchlllll Xcml 

Bgll EC057I Dral asp241 I AlwNI I 

I I • 11(1 
gcccc>gaag?cccccccccctcrraacgcggcczaaaacacaaacaaaaaaccagacccc 

5161 5220 

cgggaCwCcczaaggaggggagggicacgccaaaCitigcacccactccccggcccgaga 

BssHII 

BsaB: Bsp24I Tchlllll BssKII I 

II i It 

gcccggacciggaccaagcaagcgccccgccgcccccaz.ccaggggccccgcgcgcgcgg 

5221 * * * * 5280 

caaacccaaacccagcccgcccacagaacgacagaaacaaacccccaaaacgcgcgegcc 

Ahdl 
MSP All 

Kaeivi 
Hin4Ii 

Taqll I I SCO0109I 
Smal : t t BslEI I 

Aval I 111 Bsal i PspSII PflMI 
I I I I I II I t 

caggcccgggaccagcggccccggccgccgagggzcccgcgcaccccccccaggacgcgg 

5281 — * - * ♦ 5340 

acccgggccccggccgccagagceagcaacccccaggacacacaaaaaaggccccgcacc 

BsmBZ 
I 

caaaggcgaccccggacgcccagacacacgggcacaagcccgcccccggggcggaggcag 

5341 * * ♦ 5400 

acccccaccgagacccacaagcccacgcacccgcacccgggcagagaccccacccccacc 

?scl 
Bcsl I 

Sfcl I Bsbl P211108I 

II I 1 
caccaccgcagagccccacgccgcggggcggcgczgcagacgacccagccgcagcaggag 

5401 * ♦ ♦ ♦ ♦ ♦ 5460 

gcggcgacgcctcgaagtacgacgccccaccacaacacccaccaggccagcaccgtcccc 

Kaell scyl 
EC047III I Ban: ECOO109I ! 

lit t I 

cgccgggcgcggcgcccaaaaacgccctccagcagcaagccgaccgccaggggcaggccc 

5461 - ♦ 5520 

gcgacccgcaccacggacccccacagaaagccaccgcccgaccaacggcccccgcccggg 

Taqll BsaAl 
I I 
ccggcgcaagcgcccacaaagcggccaagcc?ggacgggcgcacacgcggggacacgaga 

5521 - ♦ ♦ 5580 

aaccacacccacaaacgccccgccaacccgaccccacccacgcacgcacccccacacccc 

Hail 

cgcaccccggaccgcacccccaggccggccacgctcccagccacacccccccggggaccc 

5581 - ♦ — 5640 

acgcagaacccgacacaaaaacccaaccgacacaagggccggcacagggaggcccccaag 

Bcivi 
BslKKAI 
BSP1286I 
BseSI I 
ApaLI I t 
BsaWI I i 1 
Drdll Bsgl i III Apol 

I I I I I I 1 

acgccgcgcagaaccaccagcacagcgcacccggcgcacccgggaaacccgccacgcagc 

5641 — ^ S700 

cacaacacgccccggcggccgcgzcacacaggccacgcgaaccccccaaacagcacaccg 

BsmSI BsaKI BsnX Figure 2 8H 

I I t 

ccagaaggaaacgcgcggaagaacccggagacgcccccgcgacccccaagaccccccacg 

5701 — — ♦ ♦ 5760 

aacccccc^ccacgcacccccccgaacccccgcgggaacaccggaggccccaaaaggcac 
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Apal 
Banll 
BSPX286I 
BscDSI 
Bmgl I 

93CXI BseSI 1 

NslI MslI I BsrDI I 

I I i t I 

cacccgcccacaacgacggcaacgggcccacgggcggcggcccsggcgaaaacaicccc; 

S^fil ♦ ♦ 5820 

gcaagcaggtatcaccaccgctacccgggsgcccgccgccggacccgcccccacaaagac 

MS II Hael 
I I 
ggatcaccaacgccacagccgrgccccaggacgagaccgtcacaggccactcczacaaag 

5821 --- * 5830 

cccagcgaccgeagcaccaacacaaggccccaccccagcagtacccggcaaaaacgcccc 

Eel I Banl Drdri 

I i I 

cgcgggcggagggcgccagaccgcggcacaacggccccacccggcccaggggcgcagcca 
5881 ♦ ^ — 5940 

gcgcccgcctcccacggcccgacgccacaccaccaaggcaggccgggccccc^caccaac 

Acci 
I 

cccccacagacccgcatcccccacgccccgagcccagatggggggaccacgcstacccgc 
S941 — - ^ -.-w—-^,---—^*. 6000 

gggagcgcccaaacgcaaagggcgcgaaacccaagcccacccccccagcaeagacggacg 

aspMl 
MspAII I 

BspMI Pvull I 

I I I 

ggggcgacgaagaaaacggtccccggggcaggggagatcagccgggaagaaagcaggccc 

SOOl — - - 6060 

ccccgccaccccccccgccaaaggccccaccccCwCtagccgaccctccLCCcgcccaag 

Apal 
Banll 
BSP1286Z 

MspAli Bsrrz Bmgl I 

BpulOZ Pvull HglSIII BseSI I 

II 11 II 

ccgagcagccgcgacctaccgcagccggcgggcccgcaaaccacacccactaccgggcgc 

6061 * — ♦ ♦ 6120 

gacccgccgacgccgaacggcgccggccacccgggcacccagcgcggacaacggcccacg 

HspAlX 

PvuII 
Pstll 

Acelll sCcI II BpulOI 
1 III t 
aaccggcagccaagagagccgcagccgccgccacccccgagcaggggggccacctcgcca 
^121 ♦ 6180 

ctgaccaccaacccccccgacgccgacggcagcagggacccgceececcggcgaagcaac 

Nspl NspX ECU Hael I 

I ... I I I 

agcacgcccccgaceegcacgccccccccgaccaaacccgccagaaggcgcccgccgccc 
S181 * ♦ ♦ ♦ 6240 

ccgcacagggaccgagcgcacaaaagggaccggcccaggcggccccccgcgagcggcggg 

BscAPI BsaZ Ecu 
t I I 
agcgacagcagcccccgcaaggaageaaagccccccaacggcccgagaccgcccgccgca 
6241 ♦ ♦ 6300 

ccgccaccgceaagaacgcceccccgccccaaaaagccgccaaaccecggcaggcggcac 

RXeAI 

Nspl Taqli Acellll 

SphI Tthlllll BstEII AartM 

I I ! til 

ggcacgccctcgagcgcctgaccaagcagccccaggcggccccacagcccggccacccgc 

6301 ♦ 6360 

ccgcacgaaaacccgcaaactggcccgccaaggcccgccagggcgccgagccagcggacg 

BspMI BseRI 

I I 

cccacggcacctcgacccagcacacccccccgccccgcgggccggggcggcccccgccgc 

^361 ♦ ♦ 6420 

agacgccgcagagccaggccgcacagaggagcaaagcgcccaaceccgccgaaagcgaca 

3SPGI 

BSIHKAI I BSCDSI ECOO109I PlCTllir© 2 8T 

BSP1286I I Bcgl I Psp5II ^J-yu^JTe ^OX 
II II I 
acggcagcagecggcgcccgcccagacgggccagggccacgcctccccacgggcgeaggg 
6421 ♦ ♦ * ♦ . * 6480 
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cgccgccatcagccacgagcaggcccgcccgsccccagcacagaaaggcgccc^jw^uccc 

Kael 

MSCI 

Bcgl BssHII Eael I 

\ \ \ \ 

ccctcgccagcgcagcccgggccacggcgaaggggcgcgcccegggccgcgcgccggeca 

6481 ♦ ♦ * ♦ ♦ * 6540 

aggagcagccgcaccagacccagcgccacccccccacgcgaggcccgacgcgcgaceggc 

BsrFI 
Haell i 
Bbsll I 
EC047III1I 1 

Smll Bce83I Ml t Eco57l 

t I I I 1 I I 

gggcgcgctcgaggccggccccgccggcgccgaagcgccgccggcctccgccccgcgcgc 

5541, ♦ ♦ ♦ ♦ ♦ 6600 

cccacgcgaaccccgaccaggacgaccacgaccicgcgacggccagaagcgggacgcgca 

DrdI 

BscDSl I sacll 
Eael Ncol t MspMXl BssKII 

Cdill SCyl I BspGI BSCDSI I I Scyt t 

1 t I I in t I 

cggccaggcagcacccgaccacggcgccacagcccagcccctccgcggcgcggcccccgg 

6601 ♦ ♦ ^ ♦ ♦ 6660 

gccggcccaccgcaaaccggtaccacagcaccaggccggggaggcgccgcaccgggaacc 

BseRI 

BssSI I * 
Haell I ! 
BsaKI I f I 

Nan III ; 
scyl Banll ) I t Bcsl ^ 

1 I I t I i I 

cgcgcagcccgcccccggaggaggcgccgcacgaggggcagcgcagacccccgagggegc 

6661 ♦ ♦ — — — — - ♦ ♦ 6720 

gcgcgccgaacgggaaccccccccgcggcgcgccccecgccacgcccgaaaacccccgea 

Bsgl Hin4I EeoOt09t 

1 I t 

agagctcgggcgcgagaaacaccgaccccggggagcaggcacccgcgccgcaggccccgc 

6721 ♦ ♦ ♦ ♦ ♦ 6780 

ccccgaacecgcgccccccacggccaaggccccccacccgcaggcgcggcgcccggggcg 

Eael 

Gdill 
Banll I 
BsiHKAl I 
Bspl286I I 
Bsal Sad I 

BsmI BssSI Xcml 1 \ SexAI 

II 111 I 

agacggccccgcaccccacgagccaggcgagccccggccgtccggggccaaaaaccaggc 

6781 ♦ * * ♦ ♦ 6840 

cccgccagagcgcaaggcgcccggcccacccgagaccggcaagccccagcccccggccca 

... Hln4I 

BsrFI I 
I i 

cccccccacgccccccgacgcgccccccacccccggcccccacgagccggcgcccacgcc 
6841 — — --»• 6900 

aagggggcacgaaaaaccacgcaaagaacggagaccaaaggcacccggccacaggcgcga 

Bce83I 
BsrBIl 
Aval t I 

BSCZ17I Hael Smll 11 

Accll Smll Scul Xhol II 
I ( i I in 

cggtgacgaaaaggccgcccgcgcccccgcacacagacctgagaggcccgcccLcgagcg 

6901 - — ♦ * 6960 

gccaccgccctcccgacaggcacaggggcacacgcccgaaccccccggacaggagcccgc 

Sac II 
MspAlIi 
BSCOSI I I 

BseRI I il Hln4X BspGI 

M It I « 

gcgcccegcggccccccccgcacagaaacccggaccaccccgagacaaaggctcgegccc 

6961 — - 7020 

cacaaggcgccaggaggagcacacccccgagcccggcgagaccecgcccccgagcgcagg 

Hael Bsaxi BslEI - Bpml FlgUrS 28 J 

II It 
aggccagcacgaaggaggccaagcgggaggggcagcggccgccgcceaccagggggccca 

7021 - ♦ ♦ ♦ 7080 

cccggccgtgcttcccccgacccaccctccccaccgccagcaacaggcgatcccccaggc 
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7201 



7321 



7381 



7441 



7501 



7S61 



44/85 

Bbsl 



PshAl 
AfXilii 
BspLOllK 

I I 



Carl 



7081 f "^5?f5f^^*^**^*^*^^^*^^'^^^"^^^^^99C*^<^*^99**Wcgact^^ 

gagcgaggccccacactcccgcgtacagcgggagaagccgcagtcccccccaccaaccal 

BsaAI 
Pmll 

Hael ] EC057I 
' ' I 

acacccacacccggcgcaccggcccacaaggacttccccccgacaccctcccccaccccc 

PflMI 

MspAII I sea: 
Earl pvull I Taci i 

' till 
cgcg.ccgcccccacccccccccgcaccgccgcccgcgagggccagccgccggggcgagc 



7140 



7200 



gcgcaagcaggagcgagagaaggcgtagcgacagacgcccccggccgacaaccccaccca 
7261 ^fjff"fff****^'^^^^^*^^*<^^^^^^^'<=^**«*tt^tcagtccccaaaaacgagg 
cgagggagactcttcgcccgcaccgaagacgcgactccaacagccaaaggcccctgcccs 
SacZI 

MspAlli Eael 
BsqRI BstDSI II Gdill 
t I M I 

aggacccgatactcacccggcccgcggcgacgcccccgagggcggccgcacccacccggc 



7260 



7320 



ccccaaacca 



7380 



7440 



caagcggaccgggcgccaccacggaaacccccaccggcgcaggcagacca 

Hindtll MiMl 
» I 

gccccccctgctagaaaaacaacagtccgaaccaccgcccgccgggcacctcccgcaacc 

BssHIX Bsgi 
BsiEI ) Eael I 

Pvul I Gdilll 
Nrul I I scyi I 1 
111 I It 

acagcaacccggcgatggagcgcagggcccggctcccgccgcgaccggcgcgcccctcgg 

cgccgccgaaccgccaccccgcgccccaaaccaaaaacagcgctagccgcgcgaggaacs 

BsaAI BssKII 

I t 

ccgcgacgcccagctgcacgtacccgcgcgcaacgcaccgccacccgggaaagacggcgg 
ggcgccacaaaccgacgcgcacaagcgcgcgccgcgcggcggcaagccccccccgccacc 

BalHXAZ 
BSP1286X 
BseSI 
ApaLZ 
BSCAPI 
Oram 
BSPX286I 
SescAI 

Bmgl I 1 I I I SacIX 
BseSI I il I I MspAin Hindi 
Banll I It 1 I BSCDSI M TChllll | 

I I I II I I III I I 

cgcgcccgccgggcaccaggcgcacgcgccaaccgcggccgcgcagggcgacaaggtcaa 

♦ ^ ^ ^ ^ ^^20 

acgcgagcagcccgcggcccacgcgcgcggccggcgccaacacgccccactgccccagcc 



7S00 



7560 



7621 



7681 



BslEI 
Eael 
Eagi 

Kaell BspQI Nod 

' III 

cgccggcggccacccccccgcgtaggcgcccgctggcccagcagaggcggccgccctcgc 

^ ^ ..^ ^ ^ggQ 

gcgaccaccgacggagaggcgcacccgcgagcaaccaggtcgcccccgccggcgggaacg 

Bsmax BacOSX Figure 28K 

I I 
gcgagcagaacggcggtagggggcccagccgcgccccgtccggggggcccgcgtccacgg 
♦ ^ ^, ^ ^ — ^ 7740 

cgcccgccccaccgccaccccccagaccgacgcagagcaggccccccagacgcaggcgcc 
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Smal 

Aval I BssHIl 
I \ I 

caaagaccecgggcagcaggcgcgcgccgaa^cagcccaccccgcacccctgcaagccca 

7741 * - 7800 

acccccggggcccgcegcccgcgcgcagcczcaccagacagaacgcaggaacgcccagac 

BsCDSI 
SCOOI09Z 1 
3ssHi: PspSII Ncol 

Haell BssHll I SanDI Scyl 

I I i It 

gcgcccgccgccacgcgcgggcggcaagcgcgcgctcacacgggczgagcgggggacccc 

7801 ♦ * - ♦ ♦ 7860 

cgcggacgacggcacgcgcccgccgcccgcgcgcgagcacacccaacccaccccccgggg 

Pf LMI 

Taqll I Nspl Hin4r 

! I I ' 

acggcacggggcgggcgagcgcggaggcgcacacgccgcaaacgccgcaaacgcagaggg 

7861 ♦ ♦ ♦ ♦ ♦ ♦ 7920 

caccgtaccccacccacccgcgcccccgcacgcacggcgcccacagcacccgcacccccc 

Sac 1 1 

Banll MspAlII 
Bspl286I BplZ BSCOSI I i BssHII 

I I I I I I 

gctccccgagcaccccaagacacgcagggcagcaccccccaccgcggacgctggcgcgca 

7921 * - * 7980 

cgagagacccataaggccccacacaccccaccgcagaaggcggcgcccacgaccgcgcgc 

BsaAl BseRl Taqll 

I I I 

cgcaaccgcacagcccgcgcgagggagcgaggaggccgggaccgaggccgccacgggcgg 
7981 ♦ ♦ — . ♦ ♦ 8040 5 

gcactagcacaccaagjracgcccccccgcccccccagccccggccccaacgacgcccgcc 

Mmel Nspl 
Bbsl I Mmel i EC057I 

.11 II » 

gccgccccgcccggaagaccacccgcccgaagacggcacgcgagccggacgacacggccg 

8041 ♦ - ♦ ♦ 8100 

cgacgagacgagccctccgacagacggacctccaccgcacacccaacccaccacaccaac 

BsaX 

Bbsl BsaKI I 
I I I 

gacgctggaagacgctgaagccggcgcccgcgagacccaccgcgccacgcacgaaggagg 

aioi ♦ ^ • * ♦ 

ccgcgaccccccgcaacttcgaccgcagacaccccggacggcgcagcgcgcgccrccccc 

, Bag I Acelll aspMI 

• Hindi 1 BscEtl I DrdI I 

^ II 1111 

cgcagg^ccgcgcagcccgccgaccagcwcggcggcgacccgcacgcccagggcgcagc 

8161 ™ ♦ - 8220 

gcacccccagcgcgccgaacaaccggccgagccgccactggacgcgcagaccccgcgcca 

BspGI - 
I 

agcccagggtccccccgacgacgccacacccatcctgccccccctccccccacagcccgc 

8221 • ♦ ♦ ♦ 8280 

ccaggccccaaaggaaccaccacagcacgaacaggacagggaaaaaaaaggcgccgagcg 

Seal 

AceZZI - £arZ Taci I 

I I II 

ggctgaggacaaaccccccgcggcccccccagcacccccggaccggaaacccgccggccc 

8281 — ♦ ♦ 8340 

ccaacccccgcccgagaagcgccagaaaggtcacgagaacccagcccccgggcagccgga 

Nspl Hindi Bgll 

ccgaacggcaagagcccagcacgcagaaciggccgacggcccgg-aggcgcagcaccccc 

8341 - * 8400 

ggcccgccacccccggatcgtacaccccgaccaaccgccggaccacccgcgccgcaggga 

T'aqll . ^ _ 

RieAi I Figure 28L 

asawi I I ^ 
BglZ BspEZ 1 ! 

I . I I I 

c ttctacgggtagcgcgcatgcccgcgcggcctcccggagcgaggcgcgggcgagcgcaa 

8401 «. ♦ 8460 

aaagacgcccaccgcgcacacggacgcgcsggaaggccccgccccacacccactcgcgcc 

Ecil 
I 

aggcgtccccgaccacgaccttgaggcaccggcacttgaagccagcgccgccgcacccgc 
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8461 



8S21 



3581 



864: 



8701 



cccacagggaccggcactgaaaccccacgaccacaaacttcagccacagcagcgta^^cg 
cccgctcccagagcaaaaagcccgcgcgccicccggaacgcggacttggcagggcgaagg 
ggacgagggtcccgccccccaggcacgcgaaaaaccccgcgcccaaaccgccccgcctcc 

£coO109I 

Earl PspSri 
' I 

accgcagcaacctczcacagaaagggcgcgccccgtaccccaacgcacaccacgcccccc 

BslHKAI 

Bani Bspl286I 

' I 
gccccggcaccccggaacggctgccaaccacccgggcggcgagcacgaccccgccaaagc 

cagggccgcggagccccgccaacaaccaacggacccgccgcccgcgctagagcagcaccg 

RleAI 

cgccgacgccgcggcccacaacgcaaagtcccaagaagcgcgggacgcccccgacggaag 

gcaactacaacaccgggcgtcacactccaaggrcccccgcgccccacgggaaccaccccc 

BslHKAI 
3spl286I 
Banll BanlZ 
BslHKAI BSP1286I 
3spl286l BPU1102I 1 j 

Pfllioei Sad Earl I i i 

Eco57i I Aceiri I Sapi i i i 

r I I i 1 III 

gcaactctccaagcccctcgcaggcgagcccctcaggggagccgagcccgcgccccgaaa 

cgccaaaaaactcaaggagcacccacccgagaagccccctcgacccgggcacgagacccc 

Bmrl 
ApaXi 
Banlli 
Bspl286Il 

Mmel I I Banll 
Bmgllll BslHKAI 

- BSP1286I 
£coOl09I Mil Sacl 

I n II I . 

^^^^*^^^^*^^^"*^**^^*^^^'^^^**^^^*^9*^^9^9CT:ccacaggccacgggcca 

cccgggccagacgccccacccccaacccccgccgcccacccgaggcgcccagcgcccggc 



8520 



8580 



8640 



8700 



8760 



8761 



8820 



8880 



ECOO109I 

PspSII Hael 
5se8647I MscI 
BspMI Aarl Nrul i Eael I 

i til II 

^^*0^*ttC9caggtggtcgcgaaaggccccaaaccggcgacccacggccaccc:ctccg 

^ ^ ^ ^ ^ gg^Q 

aaccgcaaacgcccaccagcgccccccaggacccgaccgccggacaccggcaaaaaagac 

MspAll scyl 

I I 

gggcgacgcagcagaaggcaagcgggccctgcccccagcggtcccacccaaggcccgcgg 
8941 ^ ^ g^QQ 

cccaccacgccaccccccacccgcccagaacaagggccgccagggcaggccccaagcgcc 

Bsal Ecix Real Acelll 

I III 

9001 * ♦ ♦ ^ ...^ 9QgQ 

gacccagagcgcgccgtcagcgacccccgagcagaggcggcccgaagcaccggccgtact 

BSCAPI 
BSP1236I 1 
BssSIt t 

Smglll I Pfllioai 
Bsesm I ECOO109I Bsal i BsmBI 

HI ! I I ; I 

*^'^^*^'*'^^'^^^"^***^9*^^^"^*^^^**9cacaggcccccacaccgcaggcga 

^^^^ V ♦ ♦ ♦ --^ 9120 

ccccgcgcccgacgaagggcccccgggggcaggcccacacccagagacgcagcacccacc 



Bsiei BscYIi 
Pvul UbaLII MunX 

I II I 

caaagagacgcccggcgcgaggacgcgageegaccgggaagaaccggaccccccgceacc 
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9121 ♦ ^ 

gcccccccgcgagccacgcccccacgcccggccagcccccctcgacccagagggcggcgg 
PtlHl BseRi BssSI 

ccaacccccccaccgacaaccacaccacctccatccccagggacgccgcccggcccgcgl 

BsiHKAI 
BSP1286I 
Bsgll 

Seal BseSItl 
Taci I ApaLl Ml BsrGI 

Fspl ^1 I MspAlI I Ml Tad 

cgcgccggcccccgcaaaaacgcgcgcagcaccggcagcggtgcacgggccgcacacccc 



9180 



9240 



9241 



gcacgaccgaaaacacccctgcacgcgccacgaccgtcgccacgcgcccgacacgcagga 



9300 



BssSI 3anTT 
ECONI Hindi BstEI Apo: BSP1296I 

i \ \ II 



9301 



9360 



9361 



9420 



9480 



9S40 



cgcgccccaactggaccgccggcgcgcgtccccccgccccacccccaaacccggggagcg 

Aval 
Smll 

Bbsl Tthlllll Xhol 
> I I 

fj^55f555f "^'*^'^^^'^^'^^^^*=-**^^*^*^'^^^'^^^^^*=^^^5accgcccggccgcc 

gaccgcccaaaccgaccaccagaagacgaagccgacgaacaggaaccggcagaccgacga 

^ Banir MmttI 

Bspl286r BspGIl BssKII 
* till 
9421 ^?^????^^^^^^^^^^^^^^^^^^^^^^^^^^^^^9cgagGecA^Agzce^gAzgt.ceg 

gctccccccaacgccacccagcccggcggtgcggcgcgcccgggtcccaggtccacaggc 

BscOSI 

BalEI • Xcelll - ^tyl PflMI 

I • • I • 11 

94 81 J^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^'^^^^^^^^^^^^^c^gzcc&cggzczgga. 

gcgcgccgccagccccgaaccaccgwcgtagcgcgcccaccctcgacaggcaccagaccc 

BsaKI 
Sacll 
MspAlIl 

BSCDSI I I 1 sfcl 

Banii I M t Banll i 

BslKKAI I I I I BSIHKAI 1 

BSP1286I I II I BSP1286I IPscI 

Sad ill! Bpml BspMI Sad ISbCI 

I I II I I I III 

9S41 ?^^^^^^^'^^^^'^^^^^^^^^^^'3<3^9CZCczgcaggzctACCZcgcAZAg&cgggi^ca, 

cgagggcgccgcagcccagcccgcccccgaggacgcccaaacggagcgcacctgcccagt 

Xcml 
AlwNIl 

BszYi ?flMli BsaHI 

> M I 

cccgcgcccgacccaggcccaccacggaccaaaggcccccgaccaaccaccgccgcagcc 

Sac 1 1 
MspAlIl 
BSCOSI I I Kpnl 
Hin4I Baell M BanI i Bael 

I tin 11 [ 

accgaacgccccccggcgcaggggcgccgcgccgacgccacggcgcgccgcccgccaccc 

NstI _ . ^ ^ 

„^«fpAiii Haeiv I Banii Pigure 2 8N 

BSCDSI I I scyl Hln4I I Bspl286t 

• I i I I 1 I 

^<=^C9ggggcgcccccggacgatgcacccaaaagcggcgacgcgggcgagcccccggagg 



9600 



9660 



9720 



ggcgcecccacaggaacccaccacgcagactcccgccaccgcgcccgcccgggggccccc 

AC«III 
BSSHII 
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Banll Haell 
BsaWI Bspl28S: BsaHI I 

Bspl286l Bmgl I Narl 1 

BspEX BseSI I Banll I 

< I I I I I 

^ agggggggccccggacccgccgggagagggggcaggggcacgccgscgccgcgcgrggg 

^^^^ — ♦ - - ♦ 9840 

accccecccgaggcccgggcggcccccccccccgcccccgtgcagcegcggcgcgc?ccc 

AlwNI BssHII 
I t 

. . <=*9gagccggcgctgcgcgcgcaggccgccggcgaacgcgacgacgcggcggccgarrcc 

♦ * - 9900 

gcccccgaccacgacgcgcgcacccaacgaccgcccgcgccgccgcgccgccaaccagag 

Apal 
BanXX 

Haell Bspl286l 
BsaHI I Bmgl I 

Narl I BsdSI I 

Banll I Bbsl I I Smll 

III til I 

ccgaacccggcgcccccgcgcgaagacgacgggcccggcgagcccgagcccgaaagagag 

* * * 9960 

gaccwagaccgcggagacgcacctccgccgcccgggccacccgaactcggaccccccccc 

Bce83I Hindi Bsgl 

^ I I 

^^<=9^<^a9aaccaaccccggtgccgccgacggcggcctggcgcaaaaccccccgcacccc 

^561 ♦ 10020 

aagccgtcctagccaaagccacagcaaccgccgccggaccgcgtctcagaggacgtgcag 

HaelV Eael Bglll 
Bsmfll Hin4I GcJill Earl BscYI 

I II It 

ccccgagccgtcctgataggcgaccccggccatgaaccgcccgatcccccccccccggag 



10021 



aggacccaacagaaecacccgecagagccggcacccgacgagccagagaaggaggacccc 



10080 



BsCDSI 

Bpml Mmel Ace III 

I I i 
acccccgcgcccggcccgccccacggtggcggcgaggccgccggaaacgcgggecacgag 

- — ♦ - ♦ 10140 

cagaggcgcaggecgagcgaggcgccaccgccgccccagcaacccccacgcccggcaccc 

Hael Acci 
scul sfcl 1 
1 i I 
ccgcgagaaggcgccgaggccccccccgccccagacgcggccgcagaccacgcccccctc 
^0141 ♦ ♦ ^ ^ 1Q200 

gacgcccccecgcaaccccggagggagcaaggcccgcgccgacacccggcgcgggggaag 

asaAI 
?mll 
Banll I 
BslHKAI I 
Bspl286X I 

BssHII Aarl BspMI SacI I 

I I I I I 

^^201 ♦ ♦ — * ^ 10260 

ccgcagcgcccgcgcgtaccggcggacgcgccctaacccgaggcgcacggcccgccLctg 

HaeXX 

BbsX AlwNII 
I i I 

^^cgcagccccgcaggcgccgaaagaggcagccgagggcggcggcggcgcgtcccgccac 
^°261 ♦ — ♦ ♦ * ♦ 10320 

ccgcaccaaagcgcccgcgaccccctccaceaacccccaccaccgcsacacaagacggcg 

Smll 

EcoRV HaeX I 

TatI Bce83X i scyx StuX i 
I 1 I I I I 
^aagaagcacacaacccagcgtcgcaacgcggacccgctgacaccccccaaggccccaag 
^^'321 ♦ ^ ^ 10380 

cccctccacgcaccgggccgcagcgccgcacccaagcaaccacagggggtcccggagtcc 
BSCDSX 

Ncoi Figure 280 

Styx Pf 111081 

Haell I KaeX i Bscosi BmrX BssHXX 

I I 1 I I 1 I . 

«/tiA« ^cgccccacggccccgtagaagcccacggcgaagctgaaaaaccgggagccgcgcgccga 

10381 ♦ ♦ . ♦ 10440 

cgcgaggcaccggagcacccccaggcgccgccccaaccccccgacecccaacgcgcggcc 
Hindi 

Hpal Banll 
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BseRI I BsiHKAI 
Bpml i I SSPI286I 
aseRI I I BsaXZ Bbsl Sad Tthllll 

I I t I II 

cacggct.aacccctcccccagaagacggacgagcccggcgacac-gccgcgcacr^.cgcg 

10441 ♦ ♦ ♦ ♦ * ♦ IC500 

gcgccaaccgaggaggaggccccccgcccacLcgagccgccgzcacagcgcgcgsagcgc 

SCOO109Z BseRI Z=oOI09: 

SCcI I Earl r Sari : 

t ; II I : 

cccaaaggccacaggggcccccccccccccc^caaccczcrcccrsacaagggc;^cccc 

10501 * * ♦ - 10S60 

gagcccccgacgcccccggagaagaagaagaagccagasgagaaggcacccccggagggg 

cccttcncctcccggcggcggcgggggaggggggacacrgcggcgacgacggcgcaccgg 

10561 ♦ * 10520 

aagaagaagaagaccgccgccacccccccccccccgigccgccgccgccgccgc-cggcc 

Hindi * SadI 
Acdl MspAlU 
BslEIl Kaell BscDSI II 

Sail! IEC047IIX | asp24I I li 3sal 

III I ! I I II ' 

gaggcggccgacaaagcgcccgaccacccccccgcggcgacggcgcacggccccggcgac 

10621 ♦ * ♦ 10530 

ccccgccagccgccccgcgagccagcagaggggcgccgscgccgcgcaccagagccaccg 

BslEI 
Mfflel 
Eael 
EagI 

Gdill i Bbsl 
Bsp24Z t I BsaHI ! 

Ill t i 

ggcgcggccgtccccgcgggggcgcagccggaagacgccgcccgccacgccccggccacg 

10681 ♦ ♦ ♦ ♦ ♦ ♦ 10740 

ccgcgccggcaagagcgcccccgcgccaaceccccgcggcgggcagcacagggccaacac 

Bcivi Kaell t^sil Muni 

I til 
ggccggcggggggccgccacgcggcagggacacggcgccaacgacgcaccccaacaaccg 

10741 * ♦ 10800 

ccaaccgccccccgacggcacgccgcccctacgccgcgac^gccacgcagagcigctaac 

BpulOl 

EC00109I ! BslEI 
Ecil PspSri I Hin4: BsaWI 1 

I II ; M 

ccgcgcaggcaccccgccgccgagggacccgagcgagiccgcaccgaccggaccggaaaa 

10801 ♦ — ♦ ♦ ♦ ♦ 10360 

aacacacccacgaggcggcggccccccggacccgeccaggcgcagccggcccagccccrr 

BSCDSI 
BslKKAI i 

Aval BSP1286II 
Smll BsaHI Bsp24I i I 

Xhol Bsp24I BpullOZI 1 I I 

I 1 ; I 1 1 

ccccccgagaaa'ggcgcccaaccagccacagccgcaagg^aggc^gagcaccg:;ggcggg 

10861 — - - 10920 

ggagagccccccccgcagatcggccagcgccagcgciscacccgacccgcggcaccgccc 

MspAII BsiEZ Ecil 

I I I 

cggcagcgggcggcggccggggccgctcctggcggaggcgc^gccgacgacgcaacca^a 

10921 ♦ * 1Q980 

• gccgccgcccgccgccagccccaacaaagaccgcccccacgacgaccaccacaccaaccc 

Bcea3I 

Smll Hindi I 

BsmBI I Accll I BscXI 

BSP24II I Ecil Sail I I I Bsp24I SCyll 

I I I I I I I I I II 

gcaggcggccccgagacggcggacggccgacagaagcaccacgticcccgggcccggcccg 

10981 ♦ ♦ ♦ ♦ 11040 

cacccgccagaaccccgccgcccaccagctgcctccg-ggcacaggaacccaggccggac 

Eaol 

Bsrai Gdill Figure 28P 

Fspll BslEI I BspMI ^ 

Mil 

-ccgaacgcgcaggcggccggccacgccccaggccccgctccgacaccggcgcaggcccc: 

11041 ♦ ♦ ♦ - ♦ 11100 

gacccacgcgcccgccagccggcacggggcccgaagcaaaaccgcagccgcgcccagaaa 

BsrFI 

I 

gtagcagtcccgcacgagcccccccaccggcactcccccccccccccccccczgicccgc 
lllOl ♦ • - - * 11160 
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caccaccagaacgcactc?gaaagacggcc-gcgaagaagaagaggaaggagaa-ojgacg 

HaeXI 
3saHI 1 
£aeZ Narl i 

BsCAPI Ecil Gdill Sanll i £arl 

f II I I I I 

accccctgcacccaccgctgcggcggcggcggagcccggccgcajgcggcgccccctccc 



11161 



cagagaacgcagacagcgacgccgccgcsgccccaaaccggcawccaccgcgggagaagg 



11220 



, ,,,, -<=ccai^?cgtgcgaccccgaagccccccaccggccgaagcagggrcaggicggc?acaac 

^^^2- ** * - ---- ♦ 11230 

agggcacgcacaccggggccccggggagtagccgaccccgccccgacccagccgccgctg 

BspMI 

eccn: I Ai-.i: 

MslI ; 1 Haeivi 
Bsg: Hael Aarl I ■ lAccI Hin4:i 

I I t I I J I : I 

^'=5cccggctaacacggcctgccgcacccgcgcgagggcagaccggaagccacccacgtc 

^^^^^ ♦ ♦ 11340 

cgcgagccgattacaccggacgacgcggacgcacccccaccigacctccagcaggcacag 

Hael 

MSCI 

Bsbl Eael I 3sgl 

i ' \ I 

, , ^ , , cacaaagcggcggcacgcgcccgcgccgacggcgcaagcgcagccggccacaacggacca 

11400 

gigcc-cgccaccacacgcgggcacaaccaccacacccacgccaaccggcaccgcccggc 

BsmBI 

Ban 1 1 
BsiKKAI 

Hindi BSP1286I 
HpaX BscEII sad 
1 I I 

, , Stcaacggcccggcgacccggccgcgagagcccggtgcacccgagacgcgagcaagcccc 

- 11460 

caaccgccagaccaccgggccgacgccctcgagccacacggaccccgcgcccacicggga 

BsaAI AlwNI 
SnaBi SexAI PflMI Bcivi 

I II I 

, , cgagccaaacacgcagccgicgcaagcccgcaccaggcaccggtaccccaccaaaaagcg 

^1^^^ ♦ ♦ 11S20 

gcccagtccatgcaccagcaacgttcaggcgcggcccacgaccatagggcggcircccac 

Eaet Banll Bglll 

Gdiir 3spl296I BstYI 

i I i 

cggcggcggccggcggcagaggggccagcgcagggcggccggggctccgggggcgagacc 



Aval 
Snll 
Xhol 
I 



1152 



gccgccgccgaccgccatccccccggccgcaccccaccggccccgaggcccccgccccag 



11580 



BsrFl 

. - . EcoRV Mmel BspCl KslI NgoAIv 

II III 
ttccaacacaaggcgatgatatccgcagacgtacctggacatrraggcgacgccggcggc 

^^5®* ♦ 11640 

aaggccgcactccgccactataggcacccacacggacccgiaggiccaccacggccgccg 

MSPAII 

BasKlI Fspl I 3SCAPI 

1 lit 
, , -«t9«tggaggcgcgcggaaagccgcggacgcggtcccagacgtcgcgcagcggcaaaaa 

^^^41 ♦ 11700 

ccaccacccccgcgcgccciccagcgcccgcgccaaggcccacaacgcgccgccgccicc 

BscDSI 
Ncol 

. Scyl asrFl 3SSHII 

BslHKAII Eael I SSSHII i 

Bspl286It Gdlll 1 Bgi: I I Hindi Xbal 

II II III I : 

, , - _ , ^cgccccacggccgggacgccccggccggccaggcgcgcgcaaccgctgacgccccagac 

-^^^^ r 11760 

cacgaggcaccagccccgcgagaccggccagcccgcgcgcgcragcaactgcgagacctg 

.an . Figure 28Q 

BaaXI . BseSI IBs COS 1 1 Apol 

I I I II { 

, , ,^ cgcgcaaaaggagagcccgtaagcgggcaccctcccgcggcccggcggacaaacccgcaa 

^^^^1 - ♦ 11820 

gcacgcttcccccccggacacccgcccgtgagaaggcaccagaccacccacccaagcgcc 
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Scii 
SciVl 
BslEX 
Eae-I 1 
Banll EagI I 
Ecil BslEI BspI286l Cdill I 

II t t I 

gggcaccacggcggacgaccggggcccgagccccgcacccggcc^cccgccgcgacccac 

11821 ♦ — ♦ ♦ 11880 

cceacagcaccgcccgccggccccaagcccggggeacaggccggcaggcggcaccaggca 

AacIX BslHKAX 
BSCEIZ BsaKI I SSP1286I 

I III 
gcggccaccgcccgcgcgccgaacccaggcgcgcgacgccagacaacgggggagcgcccc 

11881 * ♦ ♦ ♦ ♦ ♦ 11940 

cgccaacggcgggcgcacagcccgggcccacacgccgcagcccgccgcccccccacgagg 

Hael 5SSHII 
MscI Eael I 
Nhel Eael IGdii: ! 

i I I i ! 

cctcggcccccccccaggcgcggcggccgccgcgctagccctttcggccaccggccgcgc 

11941 ♦ * ♦ 12000 

aaaaccgaaggaaggcccgcgccgccgacgacgcgaccgaaaaaaccggcgaccggcgcg 

Sfcl 

gcagcgcaagcggccaggccggaaagcgaaagcaceaagcggcccgccccccgcagccgg 

12001 ♦ ♦ ♦ - 12060 

cgccgcacccgccaacccgacccccegcccccgcaacceaccgagcgagggacaccggcc 

3sl£Z 
Eael 
Eagl 

ECO0109I Gdill 
PspSll BsrFI I 

Styt SanOI Alol Rsril I I 

t II I t t 

agggccaccccccaagggccgagccgcgggacccccggcccgagccccggaccggccgga 

12061 ♦ ♦ ♦ * ♦ ♦ 12120 

ccecaacaaaaggcccccaacccagcgecccgggggccaagcccagagcccggccggcct 

BsaWl 
Apel BapEI 
I I 

ccgcggegaacgggggcccgcccccccgccacgcaagaceccgcccgcaaaccccccegg 

12121 ♦ ♦ ♦ ♦ ♦ ♦ 12180 

gacgccgcccgcccccaaacggaggggcagcacgccccggggcgaacgcrcaaggaggcc 

as cap: 

Banll Ssawi i 

Bspl286I NSlXI I BseRI 

I Ml I 

aaacagggacgagceccccccccgccccccccagacgcacccggcgccgcggcagacgcg 

12181 ♦ ♦ 12240 

cccgcccccgeccggggaaaaaacgaaaagggcccacgcaggccacgacgcegcccacgc 

aspl286I 

. - * BsQRIt 

Bmgll 1 
BseSIl I 

BbvCI Banz I I I 

BpulOX MspAlI MspAll NspX lilt 

I t t I I I II 

cccccccccccagcagcggcaagagcaagagcagcggcagacacgcagggcacccccccc 

12241 ♦ ♦ ♦ 12300 

ggggggaggagccgccgccgctcccgcccccgccgccgcccgcacgcccegcgggagggg 

Hindi 
SacZX I 
MspAl Z I I 

EscDSt I I I 
III I 

tccccccaccgcgccaggaggggcgacacccgcggctgacgcggcagcagacggcgacca 

12301 ♦ ♦ ♦ 12360 

&99aggacggcgcagccecccccgccgcaggcgccaaccgcgccgccgcccaccaccaac 

ApaX 
BanXX 
BspI236X 
Bmgl 
BseSZ 
HaaZI 
BsaKX 



NarX 
Banll 
SacIII I 
MspAlXI I I 
BSCOSX 111! 



Figure 28R 



BaeRI 

BspGX SCOO109X i 
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I 1 I , . 

12361 

acccffwggcgccgcggccVaVgVcVcaVewacVegUcctcVt 

Banr o»„T 
Haexx B.„iof«7 .'kS rJlH -iJi:^ " 

12421 il'll'JllV^^^^^^^^ 

cgccgacccccgcgg3agaggaccVg;;;;;;VcVc;«;;;;;;;;;;;;;;;;;;;;; 

Sacll 
MspAlI I 
BSCOSI I I 
BsaAI III 

Suni I I I I _ , Banll 

^coS7i, ! !1 ,^„BsiHI . aspl286t 

12481 Al^l^All^'ll^^^^^ 

accccgcacgcacggcgccgccccggacaVaVc7ccggcgVtc7cc^^^ 
Haeiv 

Hln4I Alol 

BseRI I BseRI I Acelil u^^r « f ®^ 



12540 



12S4 1 . ^l'?"^,^"^^^^^^ 

ccacgccccagccctcaaggcgcgccccVcVcUglcgcw^ 



12600 



BaniZ 

sseRii BSSHII I 

12 60 1 

caacgacgcgccccccccgaaacccggVccVcgVgcVcgg^^^ 

BslBI 
Eael I 
CagX I 
Gdill I 
BsaAI II 

Afllll I Koci , SexAI , ordll 
12661 

cgcgcaccgccggcggccggaccaccggVgUc7cccgUlgcU« 

Sun I 

„. BsaAI I 

Pmll I BssHIl s'cl 

1272 1 

gaaagcccccccgaaaccgtcggVgVaVgUlgcgaaclc^^^ 
RleAI 

B-eRI I N31I BSSHII Bpml 



12660 



12720 



12780 



781 All^lllV^^^^^^^^ 

ccctgaccAcgcagacaccccgaaacacrcgcgVglVcccgct^ 

BsrBI Pvull «««?*"*f 

1284 1 Jl^Alll^^^^^^^^^ 

cgagcaccgcgccgacaaggaacaccVcVccVtgVc7ccc«^ ' 



840 



2900 



Banll 
BSP1286I 

Avail MspAlI BsaBI Sfcl 



II I : 

12901 

acgcgacgacccgcaccatVcVgVgVcVccggci^^^^ ^^'^^ 

Bce83I 

pst- Msir . . Figure 28S 

^^^r Smll Bsgl Gdllll ^ 

11 It 

12961 

cgccccgcaccaccaVgVcVcVgVgVcg^Vcl^Vg;^^^^^ ""^ 

SUBSTITUTE SHEET (RULE 26) 



aNSOOClO- <WO_..99026SSA1 * > 



wo 99/02658 



PCTAJS98/14239 



13031 



53/85 

Xcml ' 

Bpuiioa: \ 

ccaccccacgcccagcccgggcaagtcncacgcccgcaagacacaccacaccccitacgc ^ ^ ^^^ 
gacaaggtacgaaccggacccgcccaaaacgcgggcgc-rcacacggtacggggaacgca 

Fspl 

Ms II 

Nspl! Haei: 

- \ I 

ccccacagacaaggaggcaaagaccgaggggtcccacazgcgcacggcgccgaaggcgct 



13140 



agggcacctgcccccccacccccagccccccaagacgzacgcgtaccgcgacccrsacga 



Smll EC057I Bce83I 

I I I 

caccttgagcgacgacctgggcgcccaccgcaacgagcgcacccacaaggccgcgagcgc 

•3141 * ♦ ♦ " * * 

acggaacccgctgccggacccgcaaacagcgctgcccgcgcaggcgccccggcacccgca 

Apal 
Sanll 

Baall aspl236l 

BstHKAI 
Bspl286I 

Sacl ECOO109H 
BsrFI Acellll 1 1 

NgoAIV BPU1102II BsiEI 2stA?I I 

1 III ' I I 1 I 

gagccggcggcgcgagcccagcgaccgcgagccgacgcacagcccgcaaagggccccggc ^ ^ 



13201 



13261 



cccggccgccgcgcccgagccgccggcgcccgaccacgcgccggacgccccccgggaccg 

MspAlI Hln4I 3sp24I Haell 

t I I > 

tggcacgggcagcggcgacagagaggccgagtcccacwctgacgcgggcgccgacccgcg ^^^^o 

accgcgcccgtcgccgctacctccccggcccaggacgaaaccgcgcccgcgac&ggacgc. 



Apal 
BanXI 
Bspl286I 
Bsp24Xt 

BmglM 3sp24: 

BseSXtl Bsp2 4I I 

ECOO109Iin MspAlIl ! 

BspMIMM Bsp24I 9gll PvmIII I Bpml Bani 

I I I 1 I } i 'I * ' 

ccgggccccaagccgacgcgccccggaggcagccggggccggacccgggccggcggcggc 

3^3321 * • • * i-*****" 

gacccggggcccggccgcgcgggacccccgccgaccccggcccggacccgaccgccaccg 

BssHlI 
Bsp24I 1 
BSSHZI \ 

acccgcgcgcgccggcaacgtcggcggcgcggaggaacacgacgaggacgatgagcacga ^ ^^^^ 



13381 



13441 



L3501 



cgggcgcgcgcgaccgccgcagccgccgcacctccciacaccgcccccgctacccatgc- 
Scal 

Tad I sell 

1 1 I 
gccagaggacggcgagcactaagcggcgacgcctccgaccagacgacgcaagacg^^ ^^^^^ 

cggccccccgccgcccatgacccgccaccacaaagac-agcctactacgcccigcgctgc 
Psci 

Sfcl I BstDSI 

1 \ ' 
gacccggcggcgcgggcggcgcLgcagagccagccgtccggcctcaaccccacggac^ ^^^^^ 

ccgggccgccacgcccgccgcgacgccccggccggcaggccggaaccgaggigcccgccg 



13561 



Kaell 

' AflTTT 

Narl I ^^tJr\ 
Banll I PflMI BssHi: Mlu* 

I I I 1 > ' 

tggcgccaggccacggaccgcaccacgtcgccgaccgcgcgcaaccccgacgcgccccgg 



13620 

accgcggcccagcacccggcgcagcacagcgaccgacgcgcgccaggaccgcgcaaggcc 

33,pf""^ B3»Hxx Figure 28T 

III ' ' 

caaca«cc«cagaccaacc9gecctccgcaactet35aagcggc3«tccc33cgcgegc» 
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1362 1 - - • 13680 

grcgccggcgcccggccg9cc9aga9gc3ccaa9accc-c;c~ac:ag9gcc?;;c?c3c 

3sl£I eael 
9SSSI Pvul CdiXX 

t I I 

aaccccacgcacgagaaggcgccggcgaccgcaaacgcgrcgg::;aaaacag?gc:acc 

13681 13740 

tcggggcgcgcgcccccccacgaccgccagcacctgcgcgaccggcc'ccgtcccggcag 

PflllOSi 
ACCI I 
SC057II ; 
Fsel ( I I 
BsrFl I I t i 
NgoAIV I Ml 

I I III 

cggcccgacgaggccggcccggcccacgacgcgccgccccagcgcg-ggcccgitacaac 

13741 ♦ ♦ ♦ ♦ ^ * 13S00 

gccgggccgccccggccggaccagacgccgcgcgacgaagccgcgcaccgagcaacgccg 

BsrFI 

MspAlI BspGI : BsgX BscDSI 

I III I 

agcggcaacgcgcagaccaacccggaccggccggcgggggacgcgcgcgaggccgcggcg 

13801 ♦ ♦ 13860 

ccgcegccgcacgcccggccggacccggccgaccaccccccacacgcgccccggcaccgc 

B stDSI 

Nco: 

Scyl 
Banll! 

BssHII Bspl286II 
BssHZX I BsaXX I i 

i I I M 

cagcgcgagcgcgcgcagcagcagggcaacccgggccccacggccgcaccaaacgcccLc 

13861 ♦ ♦ 13920 

gccgcacccgcgcgcgtcgccgccccgctggacccgaggcaccaacgcgacccgcggaag 

SaeXX 
HspAlXI 

Taci BscDSX I I 

t III 
ccgagcacacagcccgccaacgcgecgcggggacaggaggaccacaecaaccc^gcgagc 

13921 ♦ — — ♦ 13980 

gacccacgcgccgggeggcegcaeggcgeecccgcccccccgacgcggccgaaacacccg 

BcsX 
I 

gcaccgcggccaacggcgaccgagacaccgcaaagcgaggcgcaccagcccgggccagac 

13981 ♦ ♦ ♦ 14040 

cgcgacgccgaccaccaccgaccccgcggcgccccaccccacaiggccagacccggcccg 

sfci 

HaeX I EcoNI 
ACCX SCUX IPSCX 3pulOX 
till II 
caccctccccagaccagcagacaaggcctgcagaccgtaaacccgagccaggcccccaaa 

14041 - ♦ 14100 

acaaaaaaggt-ctggccacccgccccggacgcccggcacccggacccggcccgaaagccc 

TthlllX 

Ban XX BstCXI 
RleAI AluNX BSP1286X BslEI RleAX I I 

II I I I I I 

aacccgcaggggccgcggggggcgcgggcceccacaggcgaccgcgcgaccgcgcccagc 

14101 - ♦ 14160 

ccgaacgcccccgacaccccccacgcccgagggcgcccgccggcgcgccggcacagaccg 

BsaHI Haei: 
t I 
ctgccgacgcccaacccgcgcccgccgccgccgctaacagcgccccccacggacagcggc 

14161 — - ♦ * ♦ 14220 

aacgaccgcgggccgagcgcggacaacgacgacga^caccgcgggaagcgcctgccaccg 

smai Avrxi Fiaure 28U 

Aval I scyl Hael ^^j-^^fcA*.^ w w 

III 1 
agcgcgccccgggacaeacacecaggccacccgccgacacigcaccgcgaggccacaggc 

14221 * 14280 

cegeacagggccccgcgcacggacccagcgaacgactgcgacaiggcgccccggcaccca 

NspX BssHIX 
t I 
caggcgcacgcggacgagcacaccctccaggagaccacaagtgtcagccgcgcgccgggg 

14281 - 14340 

gcccgcgcacacccgcccgcacgaaaggccccccaacgcccacagccggcgcgcgacccc 

BsrPX 

BpraX BspHX 1 
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1 I I 

caggaggacacTCgcagcccggaggcaaccezaaaccacccgccgaccaaccoacggcag 

14341 ♦ ♦ ♦ • ♦ ♦ 14400 

gccctcccgcgccegccggaeccccgccgggacccgacggacgaccggccggccgccgcc 

Oral BseRI 
BscYI PmeZ SseRZ iBsaAI 

I I . Ill 

aagaccccctcgccgcacagcctaaacagcgaggaggagcgcaLtttgcgccacgcgcag 

14401 ♦ ♦ ♦ ♦ 14460 

ccccaggggagcaacgcgccaaacctgccgctcccccccgcgcaaaacgcgacgcacgcc 

BspGI 
Haell I 

Bsgl Xcml i I 

i III 
cagagcgcgagccccaacccgacgcgcgacggggcaacgcccagcgcggcgccggacacg 

14461 * r 14520 

gccccgcacccggaaccggaccacgcgc^gcsccaccgcgggccgcaccgcgacccgcac 

BslSI 
Sael I 
EagI I 
Gdill I 

BssHII Nspl BsrFI I I 

1 ,1 III 

accgcgcgcaacacggaaccgggcacgcacgccccaaaccggccgcccaccaaccgccca 

14521 ♦ ♦ ♦ 14580 

cggcgcgcgccgcaccccggcccgcacacacggagcccggccggcaaacagccggcggac 

BslCI 
Eael I 
EagI t 
Gdill I 
Nod t Aval 
I I 1 

acggaccacccgcatcgcgcggecgccgcgaaccecgagcaccccaccaacgccaccccg 

14581 ♦ ♦ — ♦ 14640 

cacecgacgaacgcagcgcgccggcggcacccggggcccacaaagcggtcacggcagaae 

BSP1286Z 
Avail 
BmgZI I 
BseSZll 
Bani 1 1 I 
1 It t 

aacccgcactggccaccgccccccggcrcctacaccgggggactcgaggcgcccgagggc 

14641 ♦ ♦ * 14700 

ccgggcgcgaccgacggcgggggaccaaagacgcggccccccaagccccacgggccccca 

AlwNI 

3SCAPI 

I 

aacgacggacccccccgggacgacacagacgacagcgcgccccccccgcaaccgcagacc 

14701 ♦ ♦ ♦ ♦ ♦ ♦ 14760 

ccgccacccaaggagaccccgecgcacccgccgcegcacaaaaggggcgctggcgcccgg 

Haell KindlZI 
. • I I 

ccgccagagccgcaacagcgegagcaggcagaggeggcgctgegaaaggaaagc^cccgc 

14761 ♦ ♦ ♦ ♦ ♦ 14820 

gacgaccccaacgccgccgcgcccgcccgcccccgccgcgacgccccccccc.cgaaggcg 

sac 1 1 
MSP All! 

Hael " - Tthlllll Haell BacOSI II 

I I I I M 

aggccaagcagcccgcccgacccaggcgctgcggccccgcggccagacgccagcagccca 

14821 ♦ ^ ♦ ♦ ♦ ♦ 14880 

cccggcccgccgaacaggccagacccgcgacgccggggcgccagcccacgaccaccgggc 

Hlndlll Hin4l Bsal Bgll 
III I 
tccccaagcttgacagggcctcccaccagcactcgcaccacccgcccgcgcccgccgggc 

14881 ♦ ♦ * ♦ ♦ ♦ 14940 

aaaggcccgaaccaccccagagaacggccgcgagcgcggcgggcgggcgcggacgacccg 

BseRI psci Fioure 28V 

BseRI I S£cl i BspMl ITJ-^jWk^^ ^wv 

I I I . I. I 

gaggaggagcaeecaaacaacccgccgccgcagccgcagcgcgaaaaaaacccgcc cccg 

14941 — — , — ♦ ISOOO 

cceecccccacggaccegccgagcgacgacgccggcgccgcgccccLCCcggacggaggc 

Bbsl 
Suni I 
I I 

gcaccccccaacaacgggacagagagcccagcggacaagacgagcagacggaagacgtac 

15001 ♦ — ♦ ♦ 1S060 

cgcaaagggccgccgeeccaccceccggaccacecgccccacccacccaceccccgcacg 
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BslHKAI 

BSP1286I e3p24: BslEI 

• ' I I 

1 e ^c^caggagcacagggacgtgccaggcccgcgcccgcccacccgtcgccaaaggcacgac 

-5^^^ * ♦ ♦ ♦ 15120 

cgcgcccccgtgcccccgcacggcccgggcgcgggcgggcgggcagcagcccccgcgccg 

RleAl 

MspAlI I 

Bsp24l II Hin4I Drdi 

i H I I 

cgtcagcggggcctggtgcgggaggacgacgacccggcagacgacagcagcgccccggac 



1S121 



gcagccgccccagaccacaccctcccgccaccgagccgtccgctgccgccgcaggaccca 



IS 180 



Fspl Oral 
I I 
, _ ccgggagggagcggcaacccgcccgcgcacccccgccccaggccggggagaacgccccaa 
^5181 ^ ^ , ^ 5^5240 

aacccccccccaccgcigggcaaacgcgcggaagcggggcccgaccccccccacaaaacc 

Ban! 
BSCXZ 
BscOStl 

NCOll 

Scyll 
Scyl Haell I 
I III 

, r-* • ********a*9cacgacgcaaaacaaaaaacccaccaaggccacggcaccgagcgtcggc 
^5241 * 153Q0 

cccccccccccgcaccacgccccaccccccgagcggccccggcaccgcggcccgcaacca 

ECOO1091 
PspSiX 
Sse8647I 

BssHII BseRI I 

I I I 

, . ccccccgcactccccccagcatgcggcgcgcggcgacgcacgaggaaggccctccccccc 

15301 ^ ^ 15360 

aaagaaca caaggggaa c cac a cgccgcgcgccgc ta ca c act cc c cccaggaggaggga 

Haell 
BsaHI I 
Nan I 

Pf 111081 Banll I Haell 

f Ml I 

cccacgagagcgcggcgagcgcggcgccagcggcggcggcgctgggcccccccctcgacg 

15361 - ♦ ♦ 15420 

ggaLgcccccacaccacccgcgccgcggccaccgccgccgcgacccaagagggaagccac 

BanI 
Saclll 
MspAlIl ! 

BspGI BSCDSI MfKpnl BspMI 
I I II t I t 
cccccccggacccgccgcccgcgcccccgcggcacccgcggcccaccggggggagaaaca 
15421 — - ♦ ^ 15480 

gaggggacccgggcggcaaacacggaggcgccacggacgccggacggcccccccccccgc 

BanI MslZ SexAt 
I 1 I 
gcacccgccaccccgagccggcacceccactcgacaccacccgcgcgcacecggcggaca 
15481 ♦ ♦ ^ 15540 

cgcaggcaacgagacccaaccgcggggacaagc^gcggcgggcacacacggaccacctgc 

Hindi ■ 
•Ahdl \ 

Haeivi I Haeiv PshAI 

Hin4Il I Hln4l BstDSl I 

11 I I II 
acaagccaacggatgcggcacccccgaaccaccagaacgaccacagcaaccctctgacca 

15541 • ♦ ♦ ♦ 15500 

cgcccagccgectacaccgcagggacccgacggtctcgccggcgccgccgaaagaccggc 

smai ^H^eii" Flgure 28W 

SCcI Aval I Kin4Z 1 

I t t II 

cggccattcaaaacaacgaccacagcccgggggaggcaagcacacagaccaccaacctcg 

15601 ♦ ♦ ^ ^ 15660 

gccagcaagccccgccaccgatgtcgggccccccccgcccgcgtgcccggcagccagaac 

BslEl 
Ahdl 
BslEI 
Bsawi I 
BsrFI I 
Haeivi 

K1R4I1 



SUBSTITUTE SHEET (RULE 26) 



WO99/02658 



PCT/US98/14239 



57/85 



PinAIl ) Bmrl Nspl 

Mil I 
acgaccggtcgcactggggcggcgacccgaaaaccaccccgcacaccaacacgccaaacg 

15661 -- • 15720 

cgccggccagcgcgaccccgccgccggacccccggtaggacgcacggccgcacggcctac 

Xmni 
I 

cgaacgagcccacgcccaccaacaagcccaaggcgcgggcgacggcgccgcgcccgccca 
-5721 15780 

acccgcccaagcacaaacggctacccaaactccgcgcccaccaccacagcgcgaacggac 

Acelll Taqi: Aval 

» 1 
ccaaggacaaccaggcggagccgaaacacgagcgggtggagczcacgccgcccgagggca 

15781 - . 15840 

gaccccccccagcccaccccgaccciacgcccacccaccccaagcgcgacgggcccccgc 

Bsi£I BsiHKAI 

Bsal pvul 3sp:236I 
I I 1 
accaccccgagaccacgaccacagaccwtacgaacaacgcgaccgcggagcaccacccga 
15841 * * -.--^ 15900 

cgacgaggccctggcaczggcacctggaacacccgccgcgccagcaccccgcgacgaacc 

EcoS'i 

j 

aagcgggcagacagaacggggccccggaaagcgacaccggggcaaagcccgacacccgca 
15501 ♦ ^ 15950 

cccacccgcccgtcccgccccaagacctcccgccgcagccccatcccaaaccgcgggcgc 

Bmrl Tthllll 
I I 

accccagaccggggcccgaccccgccaccggccccgccacgcccggggcacacacaaacg 
2-5961 ♦ * ♦ ^ 16020 

cgaagcccgaccccaaaccggggcagcgaccagaacagcacggaccccacacacgcccgc 

aagccccccacccagacaccaccccgccgccaggacgcggggcggaccccacccacagcc 

16021 • ♦ ♦ ♦ ♦ ♦ 16080 

ctcggaaggcaggcccgcagcaaaacgacggccccacgccccacccgaagcgggcgccgg 

RleAl 
Taqll I 
BpulOI II 
t I I 

gcccgagcaacccgccgggcacccgcaagcggcaaccctcccaggagggccccaggacca 
16081 — * > ^ 16140 

cggacccgccgaacaacccgcaggcgcccgccgccgggaaggcccrcccgaaaccccagc 

Pflll08l Mmel Bpml BsaHI Bgll 

I I I II 

cccacgatgacccggagggcggcaacacccccgcaccgccggacgtggacgcccaccaog 

16141 - 16200 

ggacgccaccagaccccccaccacrgcaagggcgcgacaacccacacccgcggacggccc 

Bcsl 

I 

cgagcccgaaagacgacaccgaacagggcgggggcggcgcaggcggcaocaacagcagtg 

16201 r*-^.- ♦ * ----- * * 16260 

gcccgaaccccccaccgcggcccgccccgcccccaccgcgcccgccgccgccgccgccac 

SacII Barn 
Earl MspAiri HsrDIi 

MspAir I BSCDSI It Mmeli 

.11 111 li 

gcagcggcgcggaagagaaccccaacgcggcagccgcggcaacgcagccggcggaggaca 

16261 * * * ♦ 4. 16320 

cgccgccgcgccctcccccgaggccgcgccgccggcgccgccacgccggccaccccccgc 

Beg I BseRl 
BbvCI I Bbvci I 



BpulO t 

Bcgl Bgll i 

i I I 

cgaacgaccacgccacccgcggcgacacccccgccacacgggccgaggagaagcgcgccg 

16321 ♦ ♦ * 16380 

acccgccagcacggcaagcgccgccgcggaaacggcgcgcccgacccccccccgcgcgac 



BpulOI 1 
BssHZI I t 

I 



BslEI 
Eael 
£agl 
GdllX 
MspAlII 
I I 



Fspl 
MspAlI I Ava! 

! I I 

aggccgaagcagcggccgaagccgccgcccccgccgcgcaacccgaggtcgagaagcccc 

16381 - ♦ - 16440 

cccggccccgccgceggcctcgacggcgggggcgaegcgcLgggecceagccccccggag 

Bsawi 
BsrFI 
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PinAI Bcl: rcoNI 



16441 A^A^^A^^*ll^5^^^^ 

tcctct-cggccaccagcccggggaccg-ccrVcVccgUrUcgcgcc^^ 



1S500 



Ban I 

Taqll ! 
MSPAII { I 

Smrl Hci£i:i IlKpnl 
I I 



I 



acccgziaccgccgcggaagcgggccacggcgccgaccaccgaacgcacglcgatgccgc 



16360 



16680 



16561 All'A?/^"^^^^^ 

c ?^5agc ciggcc t caggcgagc accVgVgVcsral^glgVggacc^ ^ ^ ^ ° 

BsrSI 

16621 A^^A^J^^A^fA^'l^^^^ 

gccwcgcccagacgaccagcaacggtccgcacrlcgrcVr^gggcIc^^^ 

Kaell BsiHKAI 
asaKI f Bspl286I 
Acelll Nan I BseSI i 

Bsawi I Banli I ApaLI I I 

I I Ml lit 

16681 A'Jll^JA^A^^^^^ 

gcgcggcccagccgctgaaaggccaccacccgcggcccgacaacgggcacgcgaggccc^ 
Acci 

OrdI I Ecll 
( I I 

16741 5JllVJA*l^\^^^^^^ 

cgaagacgccgccggcccggcagacgagggccgagtaggcggccaaacgglglglc^ 

afi*?if BSSHII 
Aiii-I- Aval Ordll PflMI i 

> i ! I • ! 

15 80 1 ^^^^^^J^^^^^^^}^^^J^'3'^^^^<:<^^<3^^'^ c cggcgcgcccgccagcccccaccacca 

cgcacaagccagcgaaagggcccctggcccaaaaccgcgcgggcggtcgggggcggtagc 

iSi? MspAll 

Bcgl iFspI 



16740 



16800 



16860 



I 



I I t 



16861 AA*L^JAi\^^^^ 

ggcggcagccacccctgcaaggacgagagtgcccagcgccccgcgacggcgacgcgctgc 

BseRI 
acgl I 

I I BsaHI BsaH: Aarl BspMZ 

16921 5J?All^A^A*^^^^^ 

cgcagcccccccaggccgcccaccggcaatgaccgcggcctgcggcgtgglc^^ 

ECO0109I 
I 

16981 A\l\"A^5A*l^^^^ 

aaacgccccgggacccgcatcagagcggcgcgcaggacagctcggcgcgaaaaacccgcc 
Nspr TchlUli ECOO109I TChlXiri 

' I I 

17041 A'^A^Al'^A^A^AA^^^^^^ 

cgcacaggcaggaacacagcgggccgccaccgcgcccgaccccggacgcgllgggcccgt 



16920 



16980 



17040 



17100 



BSP1286I 

Bmgi 1 , 

Bsesi i Figure 28Y 



--m'^'T ^ ^.Haeir Dram Mmel j , 

*^hl*l4.I EC047III I 3sbl Bmrll Taqll i ( 

* ' > t M III 

cccacaaaccgccccggc tccccgcgaggcc wcgcgggUlcg 
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BC9I 
aslEX 
£ael 
Eagi 
Gdill 

BssHII BssHII NOCI 

t t I 



Bmrl BsaHI 

1 1 



accgcgcgccccggggcgcgcacaaacgcggccgcaccgggcccaccaccgccgacgacg 

17X61 ♦ - ♦ ♦ ♦ 17220 

cggcgcgcgggaccccgcgcgcgcccgcgccggcgcgacccgcgcggcggcagccaccgc 

Xcml 

Bcgl 1 BssHII BseRI 

II II 
ccaccgacgcggcggcggaggaggcgcgcaaccacacgcccacgccgccaccagtgccca 

17221 ♦ ♦ ^ » 17280 

ggcagctgcgccaccaccccccccgcgcgccgacgcgcgggcgcggcggcggccacaggc 

Eael BscOSX Banit BsraBX 

Gdill Xcml I BSP1286I Haell Earl \ 

I 11 ill! 

cagcggacgcggccacccagaccgcggcgcgcggagcccggcgccacgccaaaacgaaga 

17281 ♦ ♦ ♦ * ♦ ♦ 17340 

gccacccgcgccggcaagcccggcaccacgcgccccgggccgcgacacgaccccaccccc 

ecll Bgll SCSI 

I ' II 

gacggcggaggcgcgcagcacgccgccaccgccgccgacccggcaccgccgcccaacgcg 

17341 * * ♦ ♦ 17400 

ccgcegcccccgcgcaccgcgcagcggcggcggcggccgggccgcgacggcgggccgcgc 

BslEI 
EaeX I 
Eagi t 
Cdill I EaeX Bgll 
BsrFI I 1 Gdlll Sfll BsrBI 
III t 1 I 

cggcggcggccccgcccaaccgcgcaegccgcaccggccgacgggcggccacgcgggccg 

17401 — ♦ . ♦ ♦ — ♦ ♦ 17460 

gccgcegecgggacgaaccggcgegcgcagegcggccggccgccegccggcacgcceggc 

BslEI 

Sac I I EaeX 
MspAlIl Eagi 
BSCOSX I I 3spl286X Gdill 

Eael 1 I I Bmgl t NocX 

Gdlll I li BseSI I BspGl BsrBI t 

I I I 1 11 I M 

cccgaaggccggccgcgggcaccgccaccgcgccccccaggcccaggcgacgagcggccg 

17461 ♦ — — ♦ ♦ • 17520 

gagcccccgaccggcgcccacaacagcgacacggggggcccaggcccgccgcccgecggc 

EaeX 
Gdlll 

Sac 1 1 Ahdl 
MspAlII HaelVI A£1III 

BscOSI tl Hin4II TaqlX I 

III It i I 

ccgcagcagccgcggccaccagcgccacgacccagggccgcaggggcaacgcgcaccggg 

17521 ♦ ----- ♦ ♦ 17580 

ggcgccgccggcgccggcaaccacgacaccgagccccagcgcccccgccgcacacaaccc 

Bspl286X 
Bmgl I 
BseSX 1 Fspl 
lit 

cgcgcgacccggccagcggcccgcgcgcgcccgcgcgcacccgccccccgcgcaaccaga 

17581 ♦ ♦ ♦ ♦ ♦ ♦ 17640 

acgcgctgagccaaccgccggacgcgcacgggcacgcgcgggcggggggcgcgccgaccc 

BssHII 
BclVX I 
MspAlI [ \ 

I t I 

ccgcaagaaaaaaccacccagacccgcaccgccgcacgcacccagcggcggcggcgcgca 

17641 ♦ — - * ♦ ♦ 17700 

aacgcccccccctgacgaacccgagcacgacaacacacacaggccgccgccgccgcgcgc 

Earl BglXI 
BpmZI BscYI 
I I I 
acgaagccacgcccaagcgcaaaaccaaagaagagaegccccaggccaccgcgccggaga 

17701 ♦ * ♦ — ♦ ♦ 17760 

cgccccgacacaggcccgcgctccagccccccccccacgaggcccagcagcgcggceecc 

ItS^ Figure 28Z 

I 

cctacggccccccgaagaaggaagagcaggaccacaagecccgaaagecaaagcgggcca 
17761 ♦ ♦ ♦ ♦ ♦ ♦ 17820 
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17821 



17881 



17941 



agacaccggggggcccccccctccccgccccaacgcccggggcciccgaccccgcccagc 

Bsgl 

aaaagaaaaagaaagacgatgacgacgaacccgacgacgaggcggaaccgccgcacgcca 
ctccccctttcttcccaccaccaccacccgaaccgctgccccaccttgacgacgtgcgac 

AfllXI 
Mlul 
Hinczi 1 
AccZl I 

Sail I I ! Afllll Bani 

I M I t I 

ggcgcgggcccgccgcccacgccacccccccagctgcgcaccccgcacaaaacgccgggc 

Haell 
EC047III 1 
t I 

gcaccaccgcagcccttacgcccggcgagcgccccacccgcacccacaagcgcgcgcacg 



17880 



17940 



18001 



18061 



18121 



laiai 



cgtggcggcaccagaaatgcgggccactcgcgaggcgggcgcggacgcccgcgcacatac 
EcoOlO?! 

PapSII Aval 
3se3S47l BspMt Bce83r 

Tchlllll t smll I Hael HaelXI 
II III M 

a cgaggtgcacggcgacgaggacccgcccgagcaggccaacgagcgccccggggagc c c g 

caccccacacgccgccgcccccggacgaacccgcccggccgctcgcggagccccccaaac 

Bspci 

Nspl MspAlI I Bsbl 

I ( i f 

cccacggaaagcggcacaaggacacgccggcgtcgccgccggacgagggcaacccaacac 

ggacgccccccgccgcacccccgcacgaccgcaacggcgacccgctcccgctgggtcgtg 

AlwMI 
Bsc API 
Bcsl Aarl t 
Sfcl BSCAPI I 
BspMI iPscI 1 I 
I i I f I 
ccagcctaaagcccgtaacaccgcagcaggcgctgcccgcgcctgcaccgcccgaagaaa 

gaccggaccccgggcaccgcgacgccgcccacgacgggcgcgaacgcggcaggccccttt 

BanI 
TaQll I 

MspAlI I • 

Banl Pvuiz I iKpnl 

I I i ; I 

agcgcggcccaaagcgcgagcccggtgacc-ggcacccaccgcgcagccgacggcaccca 

ccgcgccggacctcgcgcccagaccaccgaaccgcggg^ggcacgtcgaccaccacgggc 



18000 



18060 



18120 



18180 



18240 



Haeri AlwNI 8spl2a6l 
asgl I PflMI BscDSI Avail 

' ' ' I II 

*9cgccagcgaccggaagacgccccggaaaaaatgaccgcggaacctgggccggagccca 

18241 ♦ ♦ • 13300 

ccgcggccgccgacctcccacagaaccccccctaccggcaccccggacccgaccccgggc 

Tthlllll 
Haeix 

BspMI BsaKI 
Bpral I Narl 
Bael I Banli 
Gdlll I Aarl I I 

II III 



Bmrl ascDS: 
I 



18301 



18361 



aggcccgcgcgcggccaaccaagcaggtggcgccgggaccgggcgcgcagaccgcggacg 
cccaggcgcacgccggctagtccgcccaccgcggccctgacccgcacgtccggcacccgc 
Bsgl Xcal 

cccagacacccaccaccagcagcaccagcaccgccaccgccacagagggcacggagacac 
aagcccacgggcgatggccaccgcggccataacggcggcggcgccccccgcacctccgcg 



18360 



18420 



Sad I £aQl I 1 

MspAi: MspAlII EagX I i 

BbvCI I BSCOSI I I GdlZX I t 

sussrmm sheet (rule 28) 
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DrdI BpulOI I Ecil I It Bsl£I Noti i i 

> II I t i t I I I I 

aaacgrccccqgccgccccagcggcggcggacgccgcggigcaggcggccrri^cggccg 

13421 * -- 13430 

cctgcaggggccaacggagtcgccaccgcccacggcgccacgcccgccaccgacgccggc 

BSCDSI 

I 

cgcccaagacccccacggaggcgcaaacggacccgiggacgcrccgcgcrcraccccccc 

18481 * * * • 18540 

gcaggccccggagacgcccccacgcccgcccgggcacccacaaagcgcaaagzcgggggg 

Haell Haell 
BsaHI I BsaHI \ 

Marl I Narl I 

Ban! I I Banll iBssKII 

I t I lilt 
ggcccccgcgcggctcgaggaagcacggcgccaccagrgcgc^ac-^cccraacangccc 

13541 * ♦ ^ 13500 

ccgcgggcgcgccaagcccctccacgccgcggcggccgcgcgatgargggmacacggg 

BsrDI 
I 

cacacccccccaccgcgcccacccccggccaccgcggccacacccaccgccccagaagac 

18601 ♦ ♦ ♦ ♦ ♦ 13660 

acgcaggaaggcaacgcggacgggggccgacagcaccgacgcggacggcggggccccccg 

Bbsl BsaHI DrdI I 

1 I I 

gagcaaccacccgacgccgaaccaccaccggaacccgccgccgccgtcgccgccgccaac 

18661 ♦ + * 18720 

cccgccgacgggccgcggcccggcggcgaccccgggcggcggcggcagcggcagcggccg 

Nru: ECOO109: 

Bcgl Fspl Bcgl; PspSi: 

I I t I i 

ccgcgctggccccgacccccgcgcgcagggtggctcgcgaaggaggcaggaccccggcgc 

18721 ♦ ♦ ♦ ♦ ^ 18780 

ggcacgaccggggccaaaggcacgcgccccaccgagcgccccccccgcc:tgggaccacg 

BsrFI 

BssHll Dral t Ordll 

t tit 
cgccaacagcgcgccaccaccccagcaccgctcaaaagccggcccccgcggsccccgcag 

18781 ♦ ♦ ♦ * 18840 

aeggccgccgcgcgacggcggggccgcagcaaacczceggccagaaacaccaagaacgcc 

Aarl BspMI BanI Bsml 

III I 
acacggcccccacccgccgcccccgccccccggcgccgggaccccgaggaagaacgcacc 

18841 ♦ • ♦ 13900 

cacaccgggagcggacggcggaggcaaagggccacggccccaaggccccrccccacgcgg 

BscDSl 
Psdil 
Eael I I 
Gdill I I 
BsrPi I I I 
. • NgoAIV I I I 

Ea«I I I II Mspl BsrFI 

Gdlll I I 11 Bgll SphI ?sp: SgrAI 

I I I 1 I I I I i 

gcaggaggggcacggccggccacggcccgacgggcggcacgcgccgcgcgcaccaccggc 

18901 ♦ ♦ . ♦ * 18960 

caccccecccgcaccggccggtgecggaccgcccgccgcacgcageacgcgcggcggccg 

Nspl 

BssHII sphI BClVI 

I .1 I 

ggcggcgcgcgc cgcaccgccgca tgcgcggcggcaccccgcccc i cci ca c cccaccga 

18961 ♦ ♦ ♦ ♦ . 19020 

ccgccgcgcgcagcgcggcagcgcacgcgccgccacaggacggggaggaacaaggigacc 

Bspl286l 
Bmgl 
BseSI 
Haell 

SacIX BsaHI \ 
MspAlIi Narl I 

BSCDSI 1 1 Banll I 
III Ml 



Figure 28BB 



BsCDSI Hael Bgll 

t I I 

ccgccgcggcgaccggcgccgcgcccggaaccgcacccgcggccccgcaggcgcagagac 

19021 ♦ * » . ♦ . 19080 

agcggcgccgccaaccgcggcaegggccctaacgtaggcaccggaacgccegcgcccccg 

Nspl BspGI 
I I 
accgaccaaaaacaagccgcacgcggaaaaaccaaaacaaaaagtecggacceccacgcc 

19081 ♦ ♦ 19140 

cgaccaacccccgcccaacgcacacccccccagccccaccccccagacccgagagcgcga 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02658 



PCT/US98/14239 



62/85 



19I4I 



19201 



19261 



B£)SI BsriB: 
I : 

cj^wiggccccgcaaccacctcgcagaacggaagacaccaacctcgcgcccccggccccg 

gcgaaccaggacaccgacaaaacaccctaccctctgcagccgaaacgcagagaccggggc 

3anl BsrBI 
ECORV I Eco57i I 

II I i 

gccgtgcsgagcgcgggcaagcaccccscgaccgcaccatagccgcggccgtcacacccg 

Haell Banll 
BsaHI I BSP1286I 
Narl IMspAlI I 
Banll I Pvvli t BsrBI Apol 

II I I , , , 

ggtggcgcccccagccggggcccgccgiggagcggcattaaaaaccccggitccaccgcc 
ccaccgcggaagccgaccccgagcgacaccccgccgcaaccctcaaagccaaggcggcaa 

BbvCI 

Hael BpulOI 
Scul Haal AlwNII 



19200 



19260 



19320 



r 



I II 



19381 



19441 



19321 

cccctgacaccgccgccccggaccccgccgccgcg-ccggcccacgaccccciacccaac 

Hael 

Hael I I 

* 111 

ccccccgccctaaaggccgctccccaccacctaccggaccggagaccgcaaccgccccac 

Hael 
Msci 

Eael I Bgll Best Hindlll 
I I I I I 

^gcggacccggccaaccaggcagcgcaaaataagaccaacagcaagcccgacccccgcccc 

cacccggaccggccggcccgccacgtcccaccccaaccgtcacccgaaccaggggcggga 

# Bpral 
BslEZ 
BscOSZ 

BseRIt BsaXI 

Eael II Hin4II 

Eagi II TthllllM 

Gdill I I Bsaxii I ( 

BsrFI I II Hin4Illl 

I I I I I I I I 
^f^^^^^^^^^^^^^^^^^^^^ccgzggagacagzgzczccag^ggggcgzggcgaAaag 

gggcacccccccggaggcggccggcacccccgccacagaggcccccccgcaccgctcctc 

Hin4I SunI 

I I 

cgcccgcgccccgacagggaagaaaciccggcgacgcaaacagacgagccicccccgcac 

gcaggcgcggggctgcccccccttcgagaccaccgcgcccacccgcccggagggagcacg 

BstAPI Bsp241 

Haell BscOSI 

scull Ncol 

Bsp2 41 II Scyl 

BplI BseRI I II xcml I I Bsawi 

' I I I I II.. 
gaggaggcactaaagcaaggcccgcccaccacccgccccaccgcgcccacggccaccgga 



19501 



19S61 



19380 



19440 



19500 



19560 



19620 



19621 



19681 



19680 



cccccccgcgaccccgctceggacgggtggcgggcagggcagcgcgggcaccgacggccc 

BspCI ■ BspMI 
I I 

?^?^^yjy^^^^^^^^^^^^^9^^^^9czggacctgcctcccczcgccgacaccc^geag 
cacgacccggccgtgcgcgggcaccgcgacctggacggaggggggcggctgcgggccgcc 

Taqii BsiEi Flgure 2 8CC 

I I 



19740 



cccggacacgacggcccgggccggcggcaacaacactgggcaggaccggegcgcagggac 



19800 



RsrZX BslEt 
MspAlI I pvul 
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I I i 

19801 Jl^Jl^L^Al^^^^^^ 

gcggcgcggcggccgccaggcgccagcaacgccgggcacWgccaccgccgaccgcccc^ 

Haell ECOS7I 

19861 A^'ll^A^*^^^^^^^ 

cgcgacctgccgcagcacccagacccccacgccagggaccccgcggccgccacgalgacc 

^^^^^f Aceiii epuii02r 

I ( 

19921 

tatcgaccgcacagcacacacacagcacacacgcaggcacagcggcggcccccccgacga 

BssHll 

1 BC3I 

I 

19981 5A^fJl'fJ?J\^^^^^ 

cccggcggcgcgcgggcgaaaggccccaccgacggggaagccaccacggcgccaccagaa 

Smal 
Aval I 
Banll ( 

Nspl Aval BsaHI BpulOI i i 

I i I lit 

acacgcacaccccgggccaggacgccccggagcacccgagccccgggctggcgcagcccg 



19860 



19920 



19980 



20040 



20041 



cgcacgcgcagagcccggccccgcggagccscacggacccggggcccgaccacgccaaac 

Haaix 

EC057II Bsgl BscOSI ^nl] ! 



20100 



20101 



20161 



•cj:j^cgc^^^ 

gggcgcggcggctccgcacgaagtcggaccraccgcccaaaccctcggggcgccaccgcg 

Bsawi 
BsrFI 

Dralll pinAI msII 



gacgcgcgccgcaccggcgcccggccagggccgcaaaccgcgacgccaagcagggacacc 



20220 



a*-ivT Taqll 

RleAI I 

' - I I 

20221 A^J?A^A^5".^^^^^ 

cggcacccccacgacgcacgagcacgccccgcgccaagcgggatcgacacccaccattgg 

Apai 
BanZt 
Bspl286t 
BrogZ 
BseSt 

Sacll EcoO109tl 
„ ■ MspAlU ECOO109III 

BspGI BsaAI BSCDSI I : BspGI \ I | 

* ■ I i I I III. 

20281 Jl^jL^5A^55A*l*^^^^ 

cacacgacctgtaccgaaggcgcacgaaaccgcaggcgccgcacgacccgcccccgggac 

BSP1286I 
BmgX I 
BseSI 1 

Btsi styl Bani I I 

' I 1 I i 

20341 JL""A*5f^"*""'^^^ 

gaaaatccgggacgagaccgtgacggatgccgcgggaccgagggcccccacggggtctag 

Earl 

20401 

gaacgcccaccccaccccgacgacgacgagaaccccacctggatcccccccccccgccac 



20280 



20340 



20400 



20460 



Acci «"r? Figure 28DD 

Bbsl 1 BPU1102I BsaAI Banll 

II I I II 
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acaacgaagacgaagcagacgagcaa<5C^:gagcagcaaaaaacccacgtacccygycagg 

20461 ♦ - - ♦ 20S20 

cgccgcccccgccccaccegcccgcccgacccgccgccccccgagcgcacaaacccgccc 

Kaell sspl 
I I 
cgccccaccccggcataaacaccacaaaggagggcacccaaacaggcgccgaaggccaaa 

20521 * ♦ * 20580 

gcggaacaagaccacacccacaacgcccccccccacaagcccacccacagcccccagccc 

TthlllXI EcoNI 
I I 
cacccaaacacgecgacaaaacaccccaacccgaaccccaaacaggagaaccccagcggc 

20581 • ♦ ♦ ♦ ♦ ♦ ♦ 20640 

gcggacccacacggccaccccgcaaagccggacccggagcccacccccccagagccaeca 

MspAi: 

vspi Pvull 
I I 

acgaaaccgaaaccaaccacgcagccgggagagcccccaaaaagaccaccccaacgaaac 

20641 ♦ ♦ ♦ ♦ . ♦ ♦ 20700 

cgcctcgactctaatcagcacgicgacccccccaggaacccccccgacggggttacctcg 

Ndel RleAI BsmI 

i I I 

cacgtcacggeccacacgcaaaacccacaaacgaaaacggagggcaaggcaccccegcaa 

20701 — — -> -* ♦ 20760 

gcacaacgccaagcacaegccccgggcgcccacccccaccccccgccccgcaagaacacc 

agcaacaaaacggaaagccagaaagceaagcggaaacgcaaccccccccaaccaccgagg 

20761 ♦ — • ♦ ♦ . — ..^ 20820 

tcgccgccccaccccccgaccc^ccagcccacccccacgctaaaaagagccgacgacccc 

BsrGI 

BslEI BsrDI Tad 

II I 
cgaccgcaggcaacggcgacaacccgacccccaaagcggcaccgcacagcgaagacgcag 

20821 ♦ ♦ ♦ ♦ ♦ 20880 

gccggcgcccgccaccaccaccgaaccgaggaccccaccacaacacgccacccccacacc 

NspX BssSi 
I I 
acacagaaaccccagacacccacaccccccacacgcccaccaccaaggaaggcaacccac 

20881 — ♦ ♦ * — ♦ . ♦ 20940 

cacacctccggggcccgcgagcacaaagaacgcacgggcgaeaaccecccccaccgagcg 

Kaal 

scul 

Bgll 1 BsrDI 

I I 1 

gagaaccaacgggccaacaacccacgcccaacaggcccaatcacaccgcccccagggaca 

20941 ♦ 21000 

ctcccgaccacccggccgccagacacgggccgcccggaccaacgcaacgaaaacecccgc 

Taqll 

I 

accccaccggcccaacgcaccacaacagcacgggcaacacgggcgccccggcgggccaag 

21001 ♦ ♦ ♦ 21060 

caaaacaaccagaccacacaacgccgccgcgcccaccacacccacaagaccgcccggccc 

BsmX 

TChllllX I TChlllXX 
I I I 
caccgcagccgaacgccgccgcagacccgcaagacagaaacacagagctcccacaccagc 

21061 ^ ♦ 21120 

, gcagcgccaacccacgacaacacccaaacgccccgcccccgcgccccgaaagcacggccg 

OrdXX 

SexAX Kind I 

I I 
ccccgcccgaccccaccggcgacagaaccaggcacccccccacgcggaaccaggccgccg 

21121 ♦ - 21180 

aaaacgaaccaaggcaaccaccaccccggcccacgaaaagacacaccccagcccgacaac 

acagccacgacccagacgccagaaccaccgaaaaccacggaaccgaagacgaacccccaa 

21181 ♦ - * 21240 

cgccgacaccaggcccacaaccccaacaaeccccagcaecccgacccccacccgaaggcc 

EC057I BmrX vapl scyl 

111 I 
accaccgccccccaccgggaggcgcgaccaacacagagacccccaccaaggcaaaaccca 

21241 - - 21300 

caacgacgaaaggcgaccccccacaccaaccacgcccccgagaacggccccaccccggac 

ScoNX SCeX XpoX MmaX FlQUlTe 2 SEE 

I 111 
aaaeaggccaggaaaacggacgggaaaaagacgccacagaacccccagacaaaaacgaaa 

2X301 ♦ - ♦ 21360 

cccgteeagcccecccacccaccccccccccacgacgccccaaaagcccacccccacccc 
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BsCOSI 
Ncol 

sty I PflM: Apoi 

I i I 

-.1361 * ♦ ^ - ^ 21420 

acccccaacccccaccaaaacggcaccctcagccagacccacggitggacacccczccaa 
Haell 

Tad EC047III I j4jnel TacI 

^ II I t 

^*=*^^5'^*^^<^<=aacacagcgccgcawLcgcccgacaagctaaagcacagcccccrraacg 
21421 • ^ ^ ^ 21480 

aggacacgaggccgcaccgcgacacaaacgggccgcccgactccacgccaggaaggctgc 

- Tchlllll Smal 
APoI Mmel Pfllioai I v/al i 
' III II 
-"^"ttcctgacaacccaaacaccracgaccacatgaacaaacgagtggtggcicccg 
2x481 ♦ - 21540 

acccccaaagaccaccgggcicgcggacgccgacgcacc.gcccgctcaccaccgagggc 

BsiHKAl 

Scyl BSP1286I Tthllil 
I I I 
sgtcagcggaccgccacaccaaccccggagcacgccggccccccgaccacacggacaacg 
21541 ♦ ♦ ♦ ^ 21600 

ccaaccacccgacgacgcaaccggaaccccgcgcgaccagggaaccgacataccrgccgc 
xcml Hae! 

Hinc^I Msir IBsrOI I BsrBI 

' I I f { I 

^<=**cccacctaaccaccaccgcaacgccggcccgcgccaccgcccaacgctoccgggca 
21501 * ♦ ♦ ^ 21660 

agccgggcaaaccggcggcggcgccacgaccggacgcgacggcgagccacaacgacccgc 

Bspl286l 

Bmgl 1 Bani 

BsrOI BseSI I MslI I 

I II II 

^^Mccgctacgcgcccccccacacccaggcgccccagaagccctccgccaccaaaaacc 
21661 ♦ * ♦ ^ ^ 21720 

caccagcgacacacgggaaggcgcaggcccacggagcccccaagaaacggtaacicccgg 

PflllOBI 
Eco57i 1 
Banll I I Hindi 
BSP12861 I I Hpal Drdll 

ill II 
^^*^^^*^-^^^^^^5^9cccacacacccacgagt:ggaaccccaggaaggacgccaacacgg 

2^^21 ^ 21780 

aggaagaggacggcccgagcacgcggacgcccaccccgaagccctccctacaatcgcacc 

Avrll 
scyi 
Banll I 
BslHKAI I . 
Bspl2a6I 1 
Psd r I 
Sfd I Sad I BSU36I Hindi 

< I J I I t 

ccccgcagagccccccaggaaacgacctaagggccgacggagccagcaccaagczcgaca 

2^^^^ " ♦ — - * ♦ 21840 

aagacgtcccgagggaccccccaccggacccccaaccgccccggccgcaacccaaaccac 

BscOSI 

Ncol Bsaxi smll 
Styl Bsbl i RleAl I Hael 
i II III 
gcacctgcccccacgccaccwccctccccacggcccacaacaccgcccccacgctcgagg 
2^841 - — ♦ ♦ 21900 

cgcaaacggaaacgcggtggaagaaggggcaccgggcgccgcggcggaggcgcgaactcc 

BsaXI 

Bce83l Hin4II Sell 

I It i 

ccacgcccagaaacgacaccaacgaccagccccccaacgaccacccccccgccgccaaca 

♦ ♦ — 21960 

ggcacgaacccccgccgcggccgccggccaggaaaccgccgacagagaggcggcggccgc 

Bspl286I _ , 

„ , I Pxgure 28PF 

NSPI BseSI I Bmrl ^ ^^^c 

' . II I 
^ tffccccaccccacacccgccaacgccaccaacgcgcccacacccacccccccccgcaacc 
25^961 r 22020 

acgagacgggacatgggcggctgcgacggccgcacgggtacaggcaggggagggcgccga 
sacll 

MSPAIII Aflll 
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3SCDSI !l smi: amrl 

: I I i , 

»Sffcggccccccgcggccgggcccccacgc?ccccaagaccaaggaaaccccaccaccgg 

*202X - 22C30 

cccgccgaaaggcgccgacccggaagcgcgcggaaccccgacccccciggggtagcgacc 

?f 111081 
BanIX i 
3spl28€l I 
Avail I 
I I I 

"'"•OS I ^^^^^^'^^^^^^^^^^^^'^^^^^^^^•^^^^^^^^ccccacaccccacrcagacggaaccc 

cgagcccgacgccgggaacaacgcggacgagaccgagacacgggacggacccacctcgga 

Ha el MspAlX 
Msci Earl iKaeX 
Eael I Drei IPvuII i 
I ! ! I { I 
c -caccccaaccacacccccaagaaggcggccaccacccicgactcrtrcgccagccggc 
22141 • ♦ ^ - - ♦ ♦ 22200 

aaacggagccggzgcggaaacccccccaccggcaacggaaaccgagaagacagccgaccg 

Haell 

3srDl s:co47ii: 1 Hindi 
I . ill 
<^*^^9<^**t9*ccgcccgcccacccccaacgagcccgaaaccaagcgcicagctgacgggg 
-2201 ^ ,™ ^. 22250 

gaccgccaccggcggacgaacgggggccgcccaaaccccaactcgcgagccaaccgcccc 

Drdll 

Acll Bran PflMI I Nhel 
I I It I 
^^^^ ^^srgctacaacgccgcccagtgcaacacgaccaaagaccggcccccggcacaaacgccag 
-226x ♦ 22320 

ccccaacgtcgcaacgggccacaccgcaccggcccccgaccaaggaccacgcccacgatc 

Nspl 
Tad! 

I t 

ctaaccacaacactggccaccagggcccccacaccccagagagctacaaggaccgcacgc 
22321 ♦ ^ ^ ^ , 22380 

gactgacgccgcaaccgacggccccgaagacacagggccccccgacgcccccggcgcaca 

BSP24I Bsp24I 
I ( 
acccctccttcagaaacctccagcccacgagccgccaggcggcggacgacaccaaacaca 

22381 22440 

cgaggaagaaacctctgaaggccgggcacccggcagcccaccacccaccacgacccacgc 

PflMI Bsbl 
I ! 
aggaccaccaacaggcgggcaccccacaccaacacaacaaccccggacccgccggccacc 

22441 * ♦ 22500 

ccccgacggctgcccacccgcaggacgcggtcgcgccgccgagacccaaacaaccgacgg 

Hael 
scul 

. ■ -- • ' 
^^^cccccaccacgcgcgaaggacaggcctaccccgccaaccccccccacccgcccatag 
22501 ^ , 22560 

aacgggggcggcacgcgccccccgcccggacgggacgaccgaaggggacaggcgaacacc 

Bsiei 
Pvul 

Hincll SgCI Bpml 

I \ I 

gcaagaccgcagccgacagcaccacccagaaaaagtccczccgcgaccgcaccccccggc 

22561 22620 

cgccccggcgccaaccgcegcaacgggccccccccaaagaaacgccagcgcgggaaaccg 

Bscos: 

NCOl 
Scyl 

gcaccccaccccccagcaaccccatgzccacgggcgcacccacagacctgggccaaaacc 

22621 22680 

cgcagggcaagaggccaccgaaacacaggcacccgcgcgagcgcecggacccggccccgg 

BscOSt 
BamHI NcoI 

Bsaxi ECU * BscYI Scyl 

II I I 

ccctccacgccaaccccgcccacgcgccagacacgacccttgaggcggaccccacggacg 

22681 ♦ ♦ 22740 

aagagacgcggccgaggcgggcgcgcgatccgcaccgaaaaccccacccagggcacccgc 



BsiEi Figure 28GG 

Eagi I 
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Gdill 
3S1KKAI I 
BSP1286I I 

Tthlllll BsrFI I 

BanlX I BseSI I I 

BSP1286I I Tthllll ApaLI 1 I I 

II I I I I I 

agcccacccccccccacgccccgcctgaagcctttgacgcggcccgcgcgcaccggccgc 

22741 - ♦ * ♦ 22800 

ccgggcgggaagaaacacaaaacaaaccccagaaaccgcaccaggcacacgcggccggcg 

BsiSI 
3srFH 

BsaHI NgoAIVI 
Sacll I Eael I I 

MspAlII 1 BspMl eagi II 

BSCDSI I I I Fspl I Gdill I I 

(ill II III 

accgcggcgccatcgaaaccgcgcacccgcgcacgccciccccggccggcaacgccacaa 

22801 ♦ ♦ 22860 

cggcgccgcagcagccccggcacacggacgcgcgcgggaagagccggccgccgcggcgcc 

aani: 

MspAlX 65P1286: 
PvuII BstDSr I 
Bpmll Ncol ! 
tChlllllll SCyl I AlwNI 

III t I 1 

cacaaagaagcaagcaacaccaacaacagccgccgccacgggccccagcgagcaggaacc 

22861 ♦ • ♦ ♦ ♦ ♦ 22920 

gcactcctccgccegccgcagccgccgccgacggcggcacccgaggccacccgceercga 

3spl286I 

BgXII Bngi I 

BSCYI Bs«Sl I 

RleAI Banll i Eeo47lII 

1 n I I 

gaaagccaccgccaaagaccccggccgcgggccacaccccccgggcacccacgacaagcg 

22921 — -T ♦ — ♦ — ♦ 22980 

cccccggcaacagcccccagaaccaacacccggcacaaaaaacccgcggacaccgcccgc 

Nrul 
BsiEZ I 
BslSI I \ 
BsrFIl I I 
Eael II II 
EagI t I (I 

Haell Gdill II II 

I I II I I 

ccccccaggccctgcccccccacacaageccgcccgcgccacagccaacacggccggccg 

22981 ♦ ♦ ♦ — ♦ ♦ 23040 

gaaaggcccgaaacaaagaggcgcgcccgagcggacgcggcaccagccacgccggccagc 

Bmrl Hael UbaLX Nspl 

I til 
cgagaccgggggcgcacaccggacggeccccgcccggaacccgcacccaaaaacacgcca 

23041 * ♦ * . ♦ * 23100 

gccecgacccccgcacgcgacccaccggaaacggaccccgggcgcgagcccccgcacgac 

Ban IX SmlZ 
Bspl286I Bce83I BspMI 1 TChlllll Hin4I 

11 II II 

cccccccgagccccctggcccccccgaccagcgacccaagcaggctcaccagcccgagca 

23101 ♦ ♦ ♦ * * 23160- 

ggagaaacccgggaaaccgaaaagaccggccgccgagcccgcccaaacggccaaacccac 

BsrDX MspAlX 
HaelZ BslEX I 

I II 
cgagccacccccgcgccgcagcgccaccgcccccccccccgaecgccgcacaacgccgga 

23161 ♦ * - ♦ ♦ 23220 

gcccagcgaggacgcggcaccgcggcaacgaagaagggggccggcgacacaccgcgaccc 

ApaX 
Ban XX 
BSP1286X 

BmgX I BslEX 
BseSX I EaeX I 
ECOO109X I I EagX I 
TaqXX I I GdiZX I 
111 II 

aaagcccacccaaagcgcacaggggcccaacceggecgcctgcggaccactecgccgcac 

23221 ♦ ♦ ♦ 23280 

ccccaggcgggccccgcacgcccecgggccgagccggcggacacccgacaagacgaegca 



BSCOSX 

NcoX 

Mspl Scyl Bael 

I * I I 

gctcccccacgcccccgccaaccggceceaaacccccacggaccacaaccceaccacgaa 



Figure 28KH 
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23281 . . 

caaagaggegegaaaacggecsacc5g;VcVc;;;;;;;;;;;;;;:;;;;;"t;- : "3*0 

Kpnl 

Bani I aaer B3p24I 

23 341 -"l"ll\*;=55J'5A^^^^^ 

g5aacaacggccccacggVc;VaVgVa;VaVcVgVc;;;;;;;;;:;:;;;;:;;;^ 23400 
Bsp24x I Aceli: Hin4II Bpml BplI 

agcgccggecccigccgaVaVg;Vg;;wVc«Vg;;;;;;i;;;;;;;;;;;;;c^ "46o 

Nspl 

Fspl HaoT' Afllll I 

•^1 Hael. BspLOIlI i Tad 

23461 

gccacgcwceaatccccgVgVc'gVaglVaaac'agVg^^^^^^^^ 

BsrGI 
Taci Aval 

23S21 

acgaectetgcgaaagcVaVcVcVgVcVacVaU;;;;;;;;;;;;;;;;;;;;;;;^ 

Oral mil 

23S81 A\l"i"=/Jl^l^^^^^^ 

atgggggcggg«acggcagVcVcgVc*aUVcVe7agrtUc«;;;;;^^^ 

.^5- Aflirri 85P1286I Bael 

23641 

caegcggg^ccgccceVgVgVaVeVcVaVglVcrcUVew^waUUrcc^ 23700 



23701 

gcgccggtaggcgccgtegagccacVrcVaU'glgagVcVtccgicg^^^ 



23760 



EcoRV 
Haell I 

; I EMO109I sell*""" ! 



23820 



880 



23761 All®A\"^"55J^^^^^ 

gegcaaatcgcecagccegcgVcVaVaVaVcVcVaVcVecUcsccggiw^ 

Mfliel 
BslKKAII 
Bspl286Ii 
BseSI I I 
Haell ApaLI I i i 

23821 

cgegcccaacgccacgegeccVaVeVeVgcgaVcUVeaUagM^^ " 

u-_. ECOO109I 

E,ri*: BSCVI Sse8«47l 

, I Hln4I I 88P0I I BpulOI 

23881 -^Alll'?A'A\7J?J^^^^^^ 

cgaccggccgcgcgagaaeaVc«VeVgVcUgVwVg*g««Mi;;c^^^ 

BSP1286I _ . « 

Hincii . a^^J I Figure 2 811 

23941 

ccgcccgccccagtcgaaVcVaVcgVc'gVaw^cVtUeVcgig'e^^ 
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3spl286I 
3mg r I 
Bsesi I 

Hinai Msll Bplt ascEir HaiEll I Bciv: 

II I'll ! 

gctgcacccgcaccgtagcggcaccaaaaggcsaccgcgcccgg-C'gggcgiLaggaca 

24001 * > - 24060 

caacgcgagcgcggcaccaccgcagccctccacczgcacgggccagacccgcaaccccac 

AlwNI 3pulOI 
Haell ZC057II 
i 1 t 

cagcgcccgcataaaagccccgacccgcccaaaarccacccgaacccctgcgcccccaga 

24061 -- 24120 

gccgcggacgcacccccggaaccagacgaactwtcggtggacccggaaacgcggaagccc 

Msiel 

3siHKAII 
BSP1286I1 

Eael agll BseSI I i 

Nspl BSCAPI Gdill SfiX ApaL: I It 

I I I I till 

gaagaacacgccgcaagactcgccggaaaaccgawcggcccgacaggccgcgccg-gcac 

24121 - ^ 24180 

ccccccgcacggcgccccgaacggccccctgaccaaccggcccgcccggcgcagcacgcg 

Bglll Bsawi 
BstYl BsrFl 
Bsgl Bsbl I PlnAI 
111 I 
gcagcaccctgcgccggcgccggagacccgcaccacaccccggccccaccggtcccccac 

24181 — — — - - 24240 

cgccgcggaacgcagccacaacccccagacgcggzgcaaagccggggcggccaagaagcg 

SC057I BssHII 
Kael I BssHXI I 

II II 
gaccccggccccgccagaccgcccccccagcgcgcgccgcccgcccccgcccgccacacc 

24241 ♦ ♦ ♦ ♦ ♦ * 24300 

ccagaaccggaacgacccgacgaggaagccgcgcgcgacgggcaaaagcgagcagcgcag 

BslHKAI 
BSP1286I 

BsaAX I Aflll 
Pmll I Acci Smll 

II II 
cactccaaccacgcgccccccacccaccacaacgcccccgcgcagacacccaag^ccgcc 

24301 M * - ♦ 24360 

gcaaagccagcgcacgaggaacaaacagcaccacgaaggcacacccgcgaactcgagcgg 

Banll 
BSP1286I 
BssSIl 
Tchlllir I 
Bsgl 1 I 

MspAlI SscOSIi i I 

I Hit: 
cccgaccccagcgcagcggcgcagccacaacgcgcagcccgcgggcccgcgacgccLgca 

24361 ^. - 24420 

aagccagagccgcgccgccacgtcggegccgcgcgccgggcacccgagcaccacgaacac 

Pstil 

Psci Sbfl 
BSCEII BspMI Sfcl i Sfcl I 

I III It 

ggccacccecgcaaacgaccgcaggcacgcccgcaggaaccgccccaccaccgccacaaa 

24421 ♦ — — ♦ - - 24480 

ccagcggagacgcLcgctgacgcccacgcggacgcccccagcggggcagcagcagcgccc 

aslHKAI 
Bspl286l 
SacII 
MspAlI I 
MspAlI 3SC0SI ! I 
PvuII BseRII II 
I I I : I 

ggccctgccgctggtgaaggccagccgcaacccgcggcgccccccg^ccagccaggcccc 

24481 * * ^ 24S40 

ccagaacaacgaccacccccagccgacgccgggcgccacgaggagcaagccggcccagaa 

Eael I Figure 2 8JJ 

Eagi I ^ 
Mill I 
I I 

gcacacggccgccagagctcccactcggccaggcagcagcccgaagcccgcctcLagacc 

24S41 ♦ ♦ ♦ ♦ 24600 

cgcacgccggcggcctcgaaggcgaaccagcccgccaccaaaccccaagcggaaacccag 

BssHZI 
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BsaAI BSSHII 1 

BsaBI ?mll BssKZ I I I 

II 111 
gccacccacgiggcacccgcccaccagcgcgcgcgcagcccccaigccctcctcccacgc 

24601 ♦ 24660 

caacaggcgcaccacgaacaggcagccgcgcgcgcgccggaggtacgggaagagggcgcg 

BsiEI 
pvur MSpAlI 

I 1 

agacacgaccggcacacccagcgggcccaccaccgcaaccccaccccccgccicgccggg 

24661 - ♦ • 24720 

cccgcgcragccgcgcgagccgcccaagcagcggcaccaaagcgaaaggcgaagcgaccc 

Banll Earl BmrZ 
BSPI286I Sapl Earl Bbsll 
III II 

ccccccctcccccccccgcgcccgcacaccacgcgccaccgggicgcricracccagccg 

24721 ♦ . * 24780 

gagaaggagaaggagaacgcaggcgcacggcgcgcggcgacccagcagaagcaagccggc 

Bsawi 
asrFl 
PinAI 

Tchllliz sgrAZ 

I, I 
ccacaccaLocgctcaccccctccgccacgcccgaccaacaccggrgggc'gccgaaacc 

24781 * 24840 

ggcgcgacacgcgaacggaggaaacggcacgaaccaaccgcggccacccaacgacctcgg 

KaelZ 
I 

caccacczgcagcgccacacccccccccccccccccgccgcccacgaccacccccggcga 

24841 ♦ * • ♦ 24900 

gcggcaaacaccgcggcgcagaagagaaagaaggagcgacaggcgccaacggagaccacc 

BsrDI 
Kaell 

Aval Hscll 
HaelX Haell EaeX t I 

I I lit 

cggcgggcgcccgggcccgggagaagggcgeccccccccccccccgggcgcaacggccaa 

24901 ♦ * — * 24960 

acegcccgcgagcccgaaccccccccccgcgaagaaaaagaagaacccgcgccaccggcc 

Sad I 

MspAlI 
BSCOSZ 
TaQlI I 
£ael I I 

Ecll Cdilll I M Ban! Bbsl 

I I I 1 I I I I 

acccgccgccgaggccgacggccgcgggccgggcgcgcgcggcaccag?;cgcccc?cga 

24961 ♦ ♦ * ♦ 25020 

caggcggcggccccagccaccggcgcccgacccacacgcgccgcggccgcgcagaacacc 

SmaZ 

. . . Aval 

HaeXZ 
3saHZ t 
Nan I 

Hin4Z BpIZ aanZI I 

1 I III 

cgagccccccccgcccccggaccegacacgccgccccacccgccccctcgggggcgcccg 

25021 • ♦ ♦ 25080 

acccagaaggagcaggagccrgagccacgcggcggagcaggcgaaaaaacscccgcgggc 

3SC0SZ 

Bsax: I 
Hln4Ii I 
AhdX I I i 
AfllZIt i; I 
KaeZVl II NcoZ AacXI 
Hin4Zl M scyz BsaKX i 

I I II I 11 
gggaggcggcggcgacggggacggggacgacacg-ccccccacggccgggggacgccgcgc 

25081 " * - 25140 

ccccccgccgccgccgcccccgcccccgccgcgcaggaggcaccaaccccccgcagcgcg 

-ici Figure 28KK 

AvaX BseRI EarZ EaeZ I 

I 1 III 

cgcaccgcgcecgcgcccgggggcggccccgcgccgcccccccLCCcgaccggccacccc 

25141 • * - ♦ 25200 

gcgcggcgcaggcgcgagcccccaeeaaagcgcgaegaggagaagggccgaccggcaaag 

SfcX 

ccccccccacaggcagaaaaagaccacggagccagccgagaagaaggacagcccaaccgc 
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25201 - — 25260 

gaagaggacacccgccttcccccagcaccccagccagcccccccccccgccgcaccggca 

cccccccgagcccgccaccaccgcccccaccgacgccgccaacgcgcccaccacctcccc 

25261 * M ♦ 25320 

ggggagacccaagcggcggcggcggaggcggccacggcggccgcgcggacggcggaaggg 

ECOO109I 
3cce3I t 
SseRI f 

Ban! SmXI BseRI I Psp5i: 

II 111 
cgrcgaggcacccccgcccgaggaggaagaagcgaccaccgagcaggacecaggccccgc 

25321 ♦ ♦ ♦ 25380 

gcagccccgtgggggcgaacccccccccecccaccaacagcccgccccgggcccaaaaca 

BsrSX 

BbsZ Hin4I I BpII 
III f 
aagcgaagacgacgaggaccgcccagcaccaacagaggacaaaaagcaagaccaggacaa 

25381 * ♦ ♦ ♦ * ♦ 25440 

cccgcccccgccgcccccggcgagccacggccgtcccccaccccccgccccggccccgcc 

RleAI BsmBI 

t I 

cgcagaggcaaacgaggaacaagccgggcggggggacgaaaggcacggcgaccacccaga 

25441 ♦ — ♦ * ♦ 25500 

gcgcccccgcccgccccccgcccagcccgcccsrcccgctccccgcaccgccgatggaccc 

Haell 

Psci I Afiril 
BsaXI SfcX ) i Mlul 

I III : 

cgcgggagaegacgcgccgccgaagcacccgcagegccagcgcgccaccacccgcgacgc 

25501 ♦ ♦ ♦ ^ 25560 

acaceccccgecgcacgacaaccccgcagacgcegcggccacgcggcaacagacgcigcg 

Bspl286I 
Bmgl I 

BseSX I peiiiosi 
I I I 
gccgcaagagcgcagcgacgcgccccccgccacagcggacgccagccccgcccacgaacg 

25561 ♦ ♦ ♦ ♦ ♦ 2S620 

eaacgcccccgcgcegccacacggggagcggcaccgeecacagccggaacggacgcccgc 

BanIX 
BSP1286I 
Nspl • I 
I I 

ccacccaccctcaccgegcgcaccececaaacgccaagaaaacggcacacgcgagcccaa 

25621 ♦ ♦ * * 25680 

ggcggacaagagcggegcgcacggggggcccgcggcceccttgccgcgcacgeccgggcc 

TChllli: 
I 

cccgcgccccaacccecaccccgcacccgccgcgccagaggcgcccgccacccaccacac 

25681 - ♦ - - ♦ 25740 

gggcgcggagccgaagacggggcataaacggcacggcccccacgaacggcggacagcgca 

- - ■ BsrBI 

I 

ccccccccaaaaccgcaagacacccccaccccgccgcgccaaccgcagecgagcggaeaa 

25741 ♦ ♦ ♦ . ♦ ♦ 25800 

gaaaaaggccccgacgccecacggggacaggacggcacggccggcgccggcccgcccgcc 

Tthlllll 
KaeZ i 

MSPAII t I Hin4I 

Pvull I I HaeXX ScoRV ) 

III I 11 

gcagccggccccgcggcagggcgccgccacacccgacaccgccccgcccaacgaagcgcc 

25801 ♦ ♦ — ♦ 25860 

cgccgaccggaacgcegccccgcgacagcacggaccacagcggagcgagccgccccacgg 

BssHII 
I 

aaaaacccctgagggccccggacgcgacgagaagcgcgcggcaaacgccccgcaacagga 

25861 — ♦ *— ♦ 25920 

ccttcagaaacccccagaacetgcgccgccctccgcgcgccgcccgcgagacgccgtccc 

Avai'''"'^ Figure 28LL 

BsaXX SmlX I 

Bsbl XhoX I BssHXX 

I II I 

aaacagcgaaaacgaaagccaccccggagcgccggcggaacccgagggcgacaacgcgcg 

25921 ♦ ♦ ♦ ♦ 25980 

cccgccgcccccacccccagcgagaccccacaaecaeeccgagcccceaccgccgcgegc 

Bgll 
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BstEII Taqll 
t I 
^e«o, ccc*9Ccgcaccaaaacgcagcaccgaggccacccaccccgcccaccc55caCwLiact; 

25981 ^ - - 26040 

ggaccggcatgactctgcgccgtagctccagtgggcgaaacggacgggccgcgaaccgga 

Real 
Hsll 
Acelll 
BslHKAI 

HglEII Bspl2e6I 

scyl Real I II ?s?l AlwNl 

II III .11 

- , **=^^<=<=<=**99tcacgagcacagccacgagcgagctgaccgcgegecgw?cgcageccc;: 

260*1 ♦ ^ ♦ , 26100 

tggggggccccagcacccgtgccagtacccacccgaecageacgcggcacgcgccgggga 

Tthlllll 

Apol Bpml EC00109I iBseRI 
t I I I : 
^^*^^^^^*^^*^***^^^^^**^**^****^*9^^^*^95^^^^*^^*^5^*^^^Wcgacga 
26101 * ♦ ^ 26160 

cccccecccaegcccaaacgctcccgcccgtcccctcccggacgggcgccaaccgccgcr 
^B«sHII 

Nhel I BseRl 



I I 



I 



_ . _ , 5<?*9ecagGgcgccggct!:caaacgcgcgagcctgccgact.cggaggagcgacgcaaacc 
26161 4. ^ - ♦ 26220 

egccgaccgcgcgaccgaagcccgcgcgcccggacggetgaaccccctcgccgcgcctga 

BslHKAI MSPAII 
Eael Bspl286l BsaXI Nspl I 
Gdill Bcsl IBscOSI Smlll SphI I Bee93I 
( 1 i i II til 
**^5^^^9gccgcagcgcccgccaccgcggagcccgagcgcacgcageggciccccgccga 
26221 ♦ ^ - 26280 

ccaccaecggcgccacgagcaacggcaecccgaactcacgcacgccgccaagaaacgacc 

BaaAl 
SnaBI 

BsrOI SunI 
I I 
cccggagacgcagcgcaagccagaggaaacattgcaccacacccctcgacagggccacgc 

26281 — — ♦ 26340 

gggcccccacgtcgcgcccgacccccttcgcaacgcgacgcggaaagccgccccgacgca 

Banll 
BslKXAI Mmol 
Hael Bglll Bspl2861 BaaXI I Scyl 

scul BscYI saci SexAI I IBsal I Apol 

> ( I I I I I I I 

acgccaggcccgcaagacecceaacgcggagctccgcaacccggteccccaccctggaac 

26341 - - ♦ — ♦ ♦ 26400 

tgeggcecggaegtcetagaggccgcaceccgagacgctggaccagaggacggaacctca 

Bce83I Ascl 
SCyX BsCAPIl Sffill BssHII 

... I 11 I I 
^^t^eacgaaaacegceccgggcaaaaegcgcctcaccecacgcccaagggcgaggcgcg 
26401 ♦ ♦ ^ 26460 

aaacgcgcccctggcggaaecegcctcgcacgaagcaaggtgcgagLCcccgctccgcgc 

BstOSI 

NCOI 

Styl 

^ . - - - Eael I 

TChlXlI Gdill I 

' I I 

<^'^9C9actacgcccgegaccgcgcccacccatitctacgctaeacccggcagacggecac 

26461 ♦ ♦ 26S20 

ggcgccgacgcaggcgccgacgcaaacgaataaagacacgacgcggaccgcctgceggca 

AlwNI 
BstAPI 

Bce83l BseRI PscI I 

Tchlllll BtsI I Acelli smll i sfcl I i 
I 11 t It 111 

.^.^ 999cgcccggcageagcgeccggaggagcgcaaeeccaaggagccgcagaaaccgccaaa 

26521 ♦ , , 26S80 

ceegcaaaccgcegccacgaaceececcaegccggagceceecgacgceccegacgaccc 

Ea«I , 

Gdiii Figure 28MM 



ECOO109I Hael I I 

PapSII Eco47tii I I 

Sse8647l BsaXIl I I 

t lilt 



Eeil 
I 



gcaaaacctgaaggacccacggacggcccceaacgagcgccccgcggccgcgcacccggc 
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26S81 



26641 



26701 



* ^ . . ^ ^ 26640 

cgccccgaaccccccggacacctgccggaa5CtgctC5cgraggca==5gcgcgtggaccg 

ScoNZ AlwNI 
I I 

ggacaccactccccccgaacgcctgcctaaaaccctgcaacagggccrgccagacctcac 

^ ^ ^ ^ ^ ^ 26700 

cccgcagcaaaaggggcccgcggacgaacctigggacgccgccccagacggtccgaagcg 

BpulOI 
Kaell 

Nspl EC047III I 

1 1 I 

cagccaaagcacgccgcagaacitcaggaaccccaccccagagcgcicaggaatcctgcc 



26761 



gccagccccgcacaacgtctcgaaaccctcgaaacaggaccccgcgagcccccagaacgg 

BslHKAI 
BSP1286I 

BseSI I BSP1286I 

BspMI I Bnigl I Ecil 

Aarl ApaLI I I BseSl 1 Bsmi i 

* > I i II II 
cgjrcacctgcngcgcaccccccagcgacccigcgcccaccaagcaccgcgaacgccctcc 

gcggtggacgacacgcgaaggatcgccgaaacacgggcaacccacggcgcccacgggagg 



26760 



26820 



Nhel 
?sci I 

BCSI SfCl f I 

I III 

^"9ccccggggccaccgccaccctctgcagccagccaaccaccccgcccaccaccctga 
26821 " . . . ^ ^ 26880 

cggcgaaaccccggcgacgacggaagacgccgaccggccgacggaacggacggtgagacc 

BsrBI BscAPI 
Bbsl I Acci Bpml 1 
til II 
•> oo 1 ^*"*^S^«**^*cgcgagcggcgacggcccaccggag^gccaccgtcgctgcaacccacg 
26881 ♦ — — ♦ — ♦ ♦ V . ♦ 26940 

gcaccacctcccgcacccgccaccgccagacgaccccacagtgacagcgacgccggacac 

, MspAlI Bani 

a«rBX Pvuii Acelll i 

< 1 It 

^ caccccgcaccgccccccggcccgcaacccgcagccgcccaacgaaagtcaaaccaccgg 

2^^^^ * * ^ 27000 

gcggggcgtggcgagggaccaaacgccaagcgccgacgaaccgcccicagcccaacagcc 

Sacll 

EcoOl09r MspAlIl 
PspSlI BscDSI II 

SanOI Ahdl ill 

Psci I Haeivi III 

Kpnl SCci 1 I Hin4II I II 

( III It I II 

cacccccgagccgcagggccccccgcccgacgaaaagcccgcggccccggggccgaaact 



27001 



acggaaacccgacgtcccagggagcggaccgccccccaggcgccgaggccccaaccctga 



27060 



Aacil 

BsaHI I Apol 8SU3 6I 
II I I 
c^<=^<=^cggggccgcggacgccggcccacccccgcaaacccgcacccgaggactaccacgc 
27061 ♦ * ^ , 27120 

gcgaggceecgacacccgcagccgaacggaagcgcccaaacacggacccccgacggcgcg 

PflllOBI 
Bsssi UbaLZ I Bbsl 

I I I . I 

ccacgagactaggccccacgaagaccaaccccgcccgccaaatgcggagcccaccgcctg 
27121 ♦ ♦ * ♦ ^ 27180 

ggcgccccaacccaagacgcttccggctagggcgggcggcccacgcctcgaacggcggac 

Muni 
Hael I 

M3CI I 
Eaei I \ 
Bscxi I I I 

till* 

^5^c*ctacccagggccacacccctggccaaccgcaagccaccaacaaagcccgccaaga 
27181 ♦ — ^ 27240 

gcagcaaLgggccccggcgcaagaaccggccaacgcceggcagccgicccgggcggcccc 

Banir ^^^^^ 

Ahdl I BsiHKAi Figure 28NN 

o#i,,ftOT HaelVli BSP1286I ^ 

Pf 111081 Hln4Ilt Sad 

( ill 1 

gcccccgccacgaaagggacggggggcccacccggacceccagcccggcgaggagcccaa 
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27241 2730C 

caaagacgacgctctccccgccccccaaacgaacci^ggssccaggccgccccccgagtc 

Apal 
Banll 
Bspl286I 
Bmgl 
BseSI 
ECOO109II 
SacII II 
MspAlIl I I 

BseRI BstDS: t I it 

1 I II II 

cccaacccccccgccgccgcagccccaccagcagcagccgcgggcccccgcsccccagga 

27301 * ♦ » • 27360 

gggccaggggggcggcggcgccgggacagccgccgccggcgcccgggaacgaagggcccc 

MspAlI 
PvuII 
Pstll 

Taqll I j BseRI 
Bani sfcl I li BscDSI Taqll Smrl I 

« I I I I 1 111 

cggcacccaaaaagaagccgcagccgccgccgccacccacggacgaggaggaacaccggg 

27361 ♦ ♦ ^ 27420 

accgtgggcccccccccgacgccgacggeggcggcgggcgcccgcccccccctacgaccc 

SseRZ 
BseRI I Bbsl 
BseRI BseRI I I Bmrlt 

I I I I It 

acagccaggcagaggaggccccggacgaggaggaggaggacacgacggaagaccgggaga 

27421 ♦ ♦ ♦ ♦ ♦ 27480 

cgccagcccgcccccLccaaaacctgccccccctccccccgcaccaccccccgacccccc 

Hindu I 

EcoNI I earl Taqll Beg I 

III II 
gcccagacgaggaagcccccgaggccgaagaggcgtcagacgaaacaccgccacccccgg 

27481 ♦ ♦ ♦ 27540 

cggacccgcccecccgaaggccceagcccccccacagcccgccccgcggcagcgggagcc 

Haell 
BsaHI I 
Narl I 
BanI t I 
BsrFI I I I BsaWI 
BsmI NgoAlv I I I BsrFI 

BslEI I SgrAI I I I Bcgl PlnAI BseRI 

II I II I I t I 
ccgcaccccccccgccggcgccccagaaaccggcaaccggctccagcacggccacaacct 

27541 27600 

agcgcaaggggagcggccgcggggcecccagccgccggccaaggccgcaccgacgccgga 

BsrFI 
NgoAIv 
Haell 
BsaHI 
Bsu36l * r- 
BsrBI 1 Nan 
BsaXI 1 I BanI i i I Bcsl 

III Mil t 
ccgccccccaggcgccgccggcaccgcccgcccgccgacccaaccgcagacgggacacca 

27601 ♦ * • 27660 

ggcgaggagcccgcggcggccgcgacgggcaagcggctgggLcggcacccaccccgcggc 

BsrFI 
Hgieill 

Ordll I I Tthllli: 
III I 
ccggaaccagggccggcaagcccaagcagccgccgccgccagcccaagagcaacaacagc 

27661 ♦ 27720 

gaccccggccccggccacccaggcccgccggcggcggcaaccgggctctcgccgccgccg 

Tchlllll 
TtHlllII 
Bspl286l I 
Haell Bmgl I { 

scyl Bsrai BseSI I I I RleAl 

i t ,1111 I 

gccaaggctaccgctcacggcgcgggcacaagaacgccacagccgctcgcccgcaagacc 

27721 ♦ ♦ ♦ ♦ .27780 

cggccccgacggcgagcaccgcgcccgcgcccccgcggcaccaacgaacgaacgccccga 

^^^Kaei Figure 2800 

Oram I ^ 

I I 

gcgggggcaacaccccccccgcccgccgctcccccctccaccaccacggcgcggcccccc 

27781 ♦ • 27840 

cacccccgccgcagaggaagcgggcggcgaaagaagagacggcagcgccgcaccggaagg 
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27901 



Sfcl BsrFI 
8sgl I SgrAI MspAil 
II : I 
^^cc^rcaacaccccgcaccaccaccgccacctctacagcccataccgcaccggcggcagcg 
27841 ♦ ♦ ^ 27900 

gggcaccgtaggacgcaacgacggcagcagacacgccgggtacgacgiggccgccgccgc 
eaeX 

Gdlll BsiEI 
MspAlI MspAlII BsaWII 

I . II II 

gcagcggcagcaacagcagcggccacacagaagcaaaggcgaccggatagcaagaccccg 

^ ^ ^ ^..^ ^ 27960 

cgccgccgccgctgccgccgccggcgcgcctccgcccccgccggcccaccgctccgagac 

BsaKI 
Narl 
Ban! I 
BseRIi I 
BseRI 1 I I 
HaeXI 1 III 

MspAlI EC047III I I I i I 

cgccccgggccccccaggtgccgccgccgccgccgccccccccctcgcgacgcagaccgc 



27961 



28020 



KaelX 

I 

-^a^-y n ^^"*^^**^^=9taccgacccgcgagcccagaaacaggaccccccccaccccgcatgcc 

2^°^^ ♦ * ♦ ♦ ♦ 28080 

gggctgcccgggcacagccgggcgcccgaatcctcgccccaaaaagggcgagacacacga 

Acelll Bsai 
t I 
'iofsa^, *^*^^^*^**^*9agcaggggccaagaacaagagccgaaaacaaaaaacaggcccccgcga 

28081 ---- ♦ - — * ♦ 28140 

cacaaagtcgccccgtccccggctcccgcccccgacccccaccttccgcccagagacgcc 
MspAXI 

Pvull ECOS7I 
I I 
ccccccacccgcagccgcccgcaccacaaaagcgaagaccagccccggcgcacgccggaa 

281*1 ♦ * * ♦ 28200 

agggagcgggcgccgacggacacagcgccctcgcccccagccgaagccgcgcgcgaccct 

^^^^^ 

Bt>s* Earl BssHll smll Spel 
1 I I I 1 
^^^^cssassccccccccagtaaataccgcgcgccgacccccaaggaccagccccgcgcc 
28201 ♦ - - * 28260 

ccgcgcccccgagagaagccacccacgacgcgcgaccgagaacccccgatcaaagcgcgg 

Haell 
BsaKI 

Eael Marl 
- Gdill Banll 

Apol Bpml BsaXI MspAlII Bsp24I II 

I I I M III. 

no-»^i ^^^^^^^***^^^aagcgcgaaaaccacgccacccccagcggccacacccggcgccagca 

28261 — ♦ ♦ ♦ 28320 

gaaagagcccaaactcgcgctcccgacgcagcagaggtcgccggcgcgggccgcggccgc 

Nspl 

Bsp24I Afllll I 

AlwNI Haell | Apol BspLUllI I 

I II I ■ i I 

'^^^^^^^^^^^^^^^ccaccacgagcaaggaaacccccacgccccacacgcggagccaccag 

28321 — - ♦ , ♦ 28380 

ggacagcagccgcggtaacactcgcccccccaagggcgcgggacgcacaccccaacggcc 

Acelll 8pnl 
' I 
o -» o , ccacaaa tgggacctgcggccggagccgcccaagaccacccaacccgaacaaactacacg 
28381 ♦ ♦ * • ♦ ^ 28440 

ggcgctcaccctgaacgccgaccccgacgggccccgacgagccgggcccacctgacgcac 

Hindi 
RleAI 

ECOO109I smal I 

PspSII ^ Aval I I 

SanDI EcoRV i i i 

1 I I I I 



^ - ^ , , ^^c^cgggaccccacatgacaccccgggccaacggaacccgcgcccaccgaaaccgaacc 

28441 ♦ ♦ — ♦ 28S00 

ccgcgccccggggcgcaccacagggcccagttgccccaggcgcgggcggccccggcccaa 

-^.^ ^^cccggaacaggcggccaccaccaccacacctcgcaacaaccccaacccccgcagccgg 
28501 ^ ^ ^ 28560 



Apoi Figure 28PP 

ECORI 

t 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02658 PCT/US98/14239 

76/85 

gagaaecceatcc9ccgaca,co3c«ccotoga«caeeaetg««accaQg35c-ccaacc 

BsrBI SsmBI BsaHI 

28561 

gagcgacgggaccacatagccVc VcVaVaVcVagwsVcgaciccicg^ " 
28S21 ^f^^^^?^^^^^y^^^^?^^J^^}''^^^'=^^^^9cgciget.zgcgggcg<;cttzcgtciz 

cgggcccggccccaagcctaceVaVtVaVcVcVcVc^cVgUcgVccgVcgU^^ 

Smal 
Srfl 
Aval I 

Drain BslEI i I 
I I 1 I 

28681 A^A'A^f-^f^V^l^^^^^^ 



ccccacgccagcgggcccgccccatactsagcggacc5cca<3cctcccgccccacaagc^ 



28740 



Ban IX 

BslHKAI BsaWI 

BseRI BSP1286I BspEI 

Acelir 1 saci Bsal 1 

II * I II 

28741 JUl^At^fA*;^^^^^ 

gagctgccgcccagccacccgaggagcgaaccagaggcaggcctgw 

BslEI 
Eael 
Eagi 
Gdiii 
Haelll 
BsrFI! I 
NgoAIVI f 
BsaHI I I I 
Narl I I I 
BanI i Ml 
II I M 



Tchllll 
Psci I 
Sfcl I I 

28801 A^J?JL*l'^^^^^ 

ccgccgcggccggcaggaagcaagcgcggagcagcccgttaggaccgaglcgtctggigc 

Bp II Bpml Taqll 

28861 A^JIIL^AI*^^^^^^ 

aggagacccggcgcgagacccccgcaaccccgagacgccaaacaaccccccaaacacggc 

ECC3O109I 

a^APT^'^^f . PspSlI Eael ssawi 

Aval I Gdill BspEI 
' ' II I I 

ccggcctactccaacccccccccgggaccccccggccaccacccggaccaatccactccc 



28B60 



28920 



28921 



28980 



29040 



agccaga cgaaa ccggggaagagccc egg agggc egg cga caggccUg cUaa taagga 

Eeil PC1II08I 
t I 

28981 A*^A"5A^/A^5^^^^^^ 

ccgaaactgcgccaccccccgagccgcctgccgacgccgacccacaactcacctccM 

SexAI 
I 

29041 .^A^fA^^A^fJJ^^^^^^ 

cccgtcgacgcggaccccgcggaccaggcgacagcggcggcgcccacgaaacgggcgccg ^^^^^ 

Apal 
Ban 1 1 
BSP1286I 
BngI I 
BsaSI ( 



Aval ECOO109I I \ BsaHI 



29160 



t II. , 

29101 A"55JL^A'^^^^^^^ 

aggccacccaaaacgacgaaacctaacgggcccccagcacagcccccgggccgcgcgccg 

Haeii Figure 28QQ 

29161 

caggccgaacggcgggccccccccgaacgggcaccggaccaagcccccaaacgggccgcg 
ECOOI09X 
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292 



1 1 



PspSXI 
SsrBI SanOX Alot 

I I I 

cccccgccagccgagcgggacaggggaccccgtsziCLcaccgcgacctgcaaccgccc: 

ggggacgaccaacccgccccgccccccgggacacaagagcgacaccaaacgczgacagga 



29280 



Bglll 

Bglll BSCYI 
scyr BscYl Paci i 
I I t I 
***^^^^^^*^^*^*'^^^*^*^*^'^^'"*'^*9ac=iiaccccctCwaaccaacaaaaaaaa 
-^281 - - ♦ 29340 

ccggaacccaacgcagccccagAATTAATTccagaacaagggaaaccgaczacccictct 

Ape I BspGZ 

I I 

acaacaaagcaccactcacrtaaaaccagccagcaaacttcrgrccacz-XaCwCagca? 

-3^41 - - -- 29400 

caccactrcgcagcgaacgaaccccagccaatcgzccaaagacaggccaaacaagccgtt 

BseRI Acelll 
» I 
cacccccLwgccctccccccagccccggtaccgcagcccccccctggccgraaaccccct 

2^401 — - 29460 

gcggaggaacgggaggagggccgagaccacaacgicgaaggaggaccgacgcccgaaaga 

Xcml 
I 

ccacaacctaaatggaacgccagcccccccccgcccccgcccacccgcacccaccaccci 

29461 --- ♦ - - 29520 

ggcgccagacctaccccacagccaaaggaggacaaggacaggcaggcgcgggcgacagaa 

Taqll BssHII Eco57l 
I I I 
cacgccgccgcagatgaagcgcgcaagaccgccrgaagacacccccaaccccgcgcaccc 
29521 • - ^ ^ 29580 

gcacaacaacgcccaccccgcgcgccccggcagacccccacggaagccggggcacacagg 

asaxz 

Bsawt 1 
BciVI BsrFI I 
Ndel I PlnAZ I BseRI Mmel 

I I I 1 I I 

acacgacacggaaaccggccccccaaccgcgcccccccccacccccccctccgcaccccc 

29581 - 29640 

cacaccgcgcccccggccaggaggccgacacggaaaagaacgaggagggaaacacagggg 

Bel VI 

caacgggccccaagagagcccccccggggcaccc-ccccgcgcccatccgaacccccagc 

29641 ♦ 29700 

gccacccaaagcccccccagggggaccccacgagagaaacgcggacaggcccggagacca 

Nspl SsrFI 
Tchlllll sphi BspGI NgoAIV 
It II 
^^^^ tacccccaacggcacgcccgcgcccaaaatgggcaacggcccctccccggacgaggccgg 
29701 ♦ ♦ ♦ . 29760 

acggaggccaccgcacgaacgcgagccccacccgzcgccggagagagacccgccccggcc 

Banir 
Bspi2e6: 

BsaXI Hin4I 3pll 
I II 
ca&ccccacc-ceccaaaacgcaaccaccgcgagcrcaccccccaaaaaaaccaagccaaa 
29761 ♦ ^ 29820 

gctggaacggagggccctacaccggcgacacccgggcggagagcccccccggtccagc-- 

Bsgl Tthlllll 

t I 

cacaaacccggaaacacccgcaccccccacagc-accccagaagccccaaczgcggcc^c 
29821 * • ♦ * . 29880 

gcacccggacccccacagacgcggggagcgccaacggagtccccgggactgacaccgac? 

KglEXI Bsbl ECOO109I 

I I I 

-.fta<» *=3ccgcacccccaacggccgcgggcaacacacccaccacgcaaccacaggccccgccaac 

29881 * — ^ .... 29940 

gcggcgcggagaccaccagcgeccgccgcgcgagcggcacgccagcgcccggggcgac^g 

BSIHKAI 

^llllV] ^^ti^ Figure 28RR 

ApaLI I I BsrOI Sty I \ Taqll 

III 1 t I I 

cgcgcacgactccaaacccageaccgceaeceaaggaccccccacagcgccagaaggaaa 

29941 ^ * 30000 

gcacgcgcigaggcccgaaccgcaacggcgggcccccggggagcgcsacagccccccczi 
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TC.Mlli: 

Nhel ECOO109I I BlsI 

III I 
gccagccccgcaaacaccaggcccccccaccaccaccgacagcagcaccctcaccaccac 

30001 - * 30060 

cgaccgggacgctcgcagtccgggggagcggtggcggccaccgccacgggaacgacagcg 

Banll 

XcmZ BspI2a6I 
1 I 
^^'^'^^^^^^'^^^^^^**^^*^-^^^*^^^5'*9*^"*^^^**^**^^5^**^^^^*"9*««ccac 

•^0061 - 30120 

acggagcgggggagaccgacgacggcgacca^cgaacccgcaaccgaacccccccgggca 

Ban 1 1 
B5PI286I Nspl 

I I 

ccatacacaaaacggaaaaccaggaccaaagcacggggccccctcgcacgcaacagacga 
30121 ♦ * ^ 30180 

aacatgzgccccaccccccgaccccgaccccacgccccgaggaaacgcacaccgcccgcc 

BspGl vspl 

cccaaacaccccgaccgcagcaaccggcccaggcgcgaccaccaacaacaccccctcgca 

30181 ♦ — * ♦ ^ 30240 

ggacccgcgaaaccggcaccgccgaccaggcecacaccgacaaccaccacgaaggaacgc 

Scyl Bpml 
I I 

* ^ ^d<^^AAA9C&Accggagccccgggccccgacccacaaggcaacacgcaacccaacgcagc 

30241 ♦ ^ 30300 

ccgaccccaacgaccccggaacccaaaaccaagcgccccgtcacacgccgaaccacaccg 

Hin4I BsaHI 
i I 

aggaggaccaaggaccgacccccaaaacagacgccccacac!:cgacgccagccacccgcc 

30301 , 30360 

cccccccgacccccaaccaagagccccgcccgcggaacacgaaecacaaccaacaggeaa 

ApaX 
Banll 
Bspl286X 
BmgZ I 
BseSI I 
BcoO109I( I 
£coO109Ili I 
Ml I 

cgacgcccaaaaccaaccaaacccaagaccaggacagggcccccccrccacaaacccagc 

30361 ♦ ♦ ♦ • * 30420 

accacgagccctggctgacccagaccccgaccccgccccgggagaaaaacacccgagccg 

Kael 

PflMI RleAl scul 

I I r 

ccacaacccggacactaaccacaacaaaggcccctacctgctcacagccccaaacaatcc 

30421 30480 

ggcgccgaacccacaactgacgccgcccccggaaacgaacaaacgccgaagctcgccaag 

Tchlllll 

Smll BpulOl scyl I 

Tcnillll) Hineix ( Bce83Xl I 
Hindlll II Hpax I Bcsl II i sfcX 
I H II I I I I 1 

caaaaagcccgaggccaacccaagcaccgccaaggggccgacgcccgacgccacagccac 

30481 * , ♦ 30540 

gtcccccgaaccccaaccggacccgcgacggttccccaactacaaaccgcgacgccggca 

vspl Apol Ordll 

I I I 

agccaccaacgcaggagacgggcccgaacccggcceacccaatgcaccaaacacaaaccc 

30541 - ♦ 30600 

ccggcaaccacgcsccccacccgaacccaaaccaagcggaccacgcggcccgcgcccagg 

BscDSX 
NCOI 

scyl 
Ka«X ) 

^ ^ M3CII TChlllll 

Tchlllll Eael II HaeX ApoX DrdXI lEcoNI 

I I II I I ill 

<=<^^c^**acaaaaaccggccacggcccagaacccgacccaaacaaggccacggcccccaa 

30601 - ♦ - - 30660 

ggagccccgcccccaaccggcaccggaccccaaaecaagcccgccccgacaccaaggacc 



Haei Bani Flgure 28SS 



-./x^^ ^ccaggaaccggcctcagccccgacagcacaggcgceaccacagcaggaaacaaaaacaa 

30661 ♦ * ♦ 30720 

cgacccccgaccggaaccaaaaccgcegcgcccacggcaacgceaccccccgcccccacc 
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ACCI 
SCCI I 

BsaXI Aceltl I I 

I III 

"^^^^^"^^^^^^^^^^^^♦••^••••^•♦••^•^••••■►^•^■►•••^^^^^•^^^ 30730 
accacccgaccgaaacacccggcgcggccgaggcagaggaccgacacccgacccacgccc 

Sfcl 

•»moi ^*^^^^^^^^^**^c^ac«^-tS9tcccaacaaaacgcggcagccaaacacctgccacagc 

~ " ~ ** ~ " ^ *** ^ ^ ^ ^ ~ ~ — — • ^ 30840 

ctctctacgacccgagtgaaaccagaaccgctccacaccgtcagcccacgaacgacgcca 

BslHKAl 

PflMI Bspl286l 

t 1 
lAo^ 1 ^^^^^^^^^^^ctgtcaaaggcagcccggccccaacacccggaacagcccaaagcgctca 
J0841 ^ ^ ^ 30900 

aagccaaaaccgacaacccccgccaaaccgaggccacagaccccgccaagccccacgagc 

BspGI 

Tnoo' ^*^^*^^*^**9*^t^9acgaaaacggagcgccactaaacaacccctccccggacccaga 
-u^^oa ♦ ♦ ♦ — ♦ ^ 30960 

agaacaacactctaaaccgcctccaccccacgacgacctgccaaggaaggacccgggccc 
Bscxi Bglll 

SSPII BSCYI Mme! Eco57l 

<< III 

•»Aoci *'^*^^^®**^^^^*9***tOTa9accccaccgaaggcacagcccacacaaacgccgccgg 
30961 ♦ ♦ ^ ^ ^ ^ 31020 

tacaaccccgaaacccccacccctagaacgactcccgcgccggacacgctcgcgacaacc 

^^^^^'^^cccaacccaccagcccacccaaaatcccacggtaaaaccgccaaaagcaacac 
3102* ♦ — ♦ ^ ^ ^ ^ 31080 

caaacacggactggacagccgaacaggccccagagcgccaccccgacggcccccaccgca 

-i 1 AO 1 ^^^*=*^^^**9^tc*cctaaacggagacaaaactaaacctgcaacaccaaccaccacacc 

31081 ♦ ♦ ^ ^ ^ ^ 31140 

acagccagcccaaacgaatccgcccccgccccgacccggacactgcgactggtaacgcga 

MslI 

t 

111^1 ^^^^srgcacacaggaaacaggagacacaaccccaagcgcacaccccacgccacccccacg 
31141 ♦ ^ ^ ^ ^ 31200 

cccgccacgcgccccccgccccctgcgccgaggctcacgcatgagacacagcAaaagcac 
Kael 

MSCI 

Eael J vspl sspl 
I « II 
^gaccggcccggccacaaccacactaacgaaacacccgccacacccccccacacccctcc 
3.201 • — ^ ^ ^ 31260 

cccgaccagaccggcgccgacgcaatcaccccacaaacggcgcaggagaacgtgaaaaag 

Ss^DI Afllll Muni 
' I I 
, ^'^•"t.Wccaagaacaaagaaccgcttgcgccacgcctcaacgcgcccacctctcaac 
31261 — ♦ ♦ ♦ ♦ ♦ 31320 

cacgcaacgggcccccactcctcagcaaacacaacacaaagccgcacaaacaaaaagcca 

ApoZ 

I 

^9"^****'^tccaagccaccccccatccagcagcacagccccaccaccacacagcccac 

31321 * — — ♦ ♦ 1. ♦ 31390 

acgccccccaaagcccagcaaaaagcaagccaccacaccggggcggcggcgcaccgaaca 

BspMl 

acagaccaccgcacctcaaccaaactcacagaaccccagcacccaacccgccacctccct 
31381 » ^ ^ 31440 

cgcctagcggcacggaactagcccgagcgccccgggaccacaagccggacggcggaggga 
Bsbl Taci Hael 

^ , - . , *^*=^**"<=*<=*9a^cacacagccccctccccccggccggcctcaaaaagcaccacaccat 

- - 31500 



OSgccgcgcgccccacgcgccaggaaagaggggccgaccggaacctcccgcagcacagca 

31560 



cccactgcccgcacaagaacccacaacacaaggcgcgccaaaggacagcccggcccgcga 

Ann*""? Figure 28TT 

Vspl Avar I Smll I BspOX I 
> I i II II 
^*^^*9^9*^*tt*acaaaccccccgggcagctcacccaagcccacgccgccgcccagct 
31561 ♦ ^ ♦ ♦ 31^20 
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31680 



31740 



31800 



gcagccaccacaaccacccgaggggcccgccgagtgaacccaagcacagcgaco^yccsa 
BPU1102I 
AlwNII 

( I Mmel 

31621 -l^ll^A^A^^^^^ 

cgactcggcgtccgacgacVgVcVg^VcVcUVc;;;;;^^^^ 

I sflrf ! 

31681 All^Jll^'L^ll^^^^^^^ 

cgcggacgtacccccatcccagcatVa;;;;;VaVc;;Va;;;;^^^^^^ 

PSCI 3CSI 

I BsrBIl Sfcl I BseRii i 

31741 -l^-^A^.l^^^^^^^^ 

cgcgcgcccactcgacgacggcgVcVgVgVgVcwa^V^^^^^^ 

BbyC'*' • Kaell 

BpuiOl BsaHI I BSP1286I 

Bsal ! ^^^^ ' 

I Ban! I I Bsesi i 

3 180 1 Alllll^A^^^^^^^^^ 

agaggagccgctaccaagcgcggcgggcVcc^cVcrccgcggai^ 

SCcI 

AlwNIIPsCi Sspl 

31861 Al!?Jl\l^^^^^^ 

ccgcgcgggaccagagcVaVcVt;VcVg;VcVaVw^^^^^ 

RleAI 
Haelli BclVl 

3 192 1 

agccctagggcgccacgtcccgcVaVacrgVcVccgigUd^^^ 
Kael 

M3CI 

Cael I 
BsaAl I : 
Pmli I I BspMl 

I I r I BspGI 

31981 

gcaccggcagcatggcgccVgVgVcVaVcU;Vc;U^^^^ 

Kpnl 

Nspl Bani i 

3204 1 

cgcacctgcaatggagaaaaccgVaVaVcVcUaVtWCw^ 

32101 

agaecMcccgtaccgeggcaOTVTOVgVcaaVaVccwcilcewM 
Psci 

^'^f [ BSP1286I 

32X61 

gccgacacgcgacgtccccVgVcVcVgVcVc^^^^^^^^ 

BSCOSI 

NcoZ BsaAI _ . ^ nvwv 

sty^ Real Er«Rv . P"^^ Figure 2 8UU 

3222 1 A\l*Lll^J^^^^^^ 

ccggcacccagcagcacgagcagVaVcVcaVcUUlccVc^cw^ 



31860 



31920 



31980 



32040 



32100 



32160 



32220 



32280 
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BseRI 

Bsu36l t Drdll 
II ( 
cacacccccccaggactacaagccccccccgcGCiagaaccacaccccagggaacaaccc 

32281 * - -- • 32340 ' 

acgtga^ggagccccaacgcccgaggagggcgcaaccccggcacagggccccccgctggg 

PstI 

3tsl I BbsZ 

Sfcl 1 RleAI IBsaAl Oralll 
I i III I 

acccccgaaccagcgcaaaccccacaccgcagggaagaccccgcacgcaacccacgccg^ 

32341 ♦ • — 32400 

caaggacccagccgcacccagggcgcgacgccccccccggagcgcgcaccgagcgcaaca 

HspAII 
Bpml I 
I I 

gcactgccaaagcgccacacccgggcagcagcggatgaccciccagcacggtagcgcggg 

32401 ♦ ♦ — 32450 

cgcaacagccccacaacgcaagcccgccgccgcccaccaggaggccacaccaccgcgccc 

AccI 
I 

cccccgccccaaaaggaggcagacgacccccaccgcacggagcgcgccgagacaaccgag 

32461 ^ — * — ♦ 32520 

aaagacagagcccccccccacccgctagggacgacacgccccacgcggccccgccggccc 

PflllOSI 

Bsbl I Tthllll 

I I I 

accgcgccggccgcagcgccacgccaaacggaacgccggacgcagccacaccccccgaag 

32S21 — - ♦ ♦ 32330 

cagcacaaccagcaccacagcacggcccaccccgcggcccgcaccagcacaaaggacccc 

BsmBI 

Bglll Bsawi I 

SexAI ECOS7I BstYI Tthillll I Bsal 

1 I 1 III 

caaaaccaggcgcgggcgcgacaaacagacccgcgcccccggccccgccgcccagaccgc 

32531 ♦ * ♦ ♦-^ ♦ 32S40 

gccccggcccacgcccgcaccgtccgcccagacgcagaggecagagcggegaacccagcg 

Haell 
Xcral 
BsaKI i 
Narl ; 
Banll I 
I I I 

cccgtgtagcagccgcagcacacccaccccctcaaagcacccaggcgccccccggccccg 

32641 ♦ ♦ — . • ♦ ♦ 32700 

agacacaccaccaacatcacataggcgagagagccccgcaggcccgcgggggaccgaagc 

MspAII 

L 

ggccccacgcaaacccccccacgcgccgccgccccgacaacacccaccaccgcagaacaa 

32701 — ^.-t-r - * ♦ ♦ 32760 

ccaagacacacttgaggaagcacgcggcgacgggaccaccgcaggcggtggcgccccacc 

BsrBI 
Earn 
Sapli 

TaqZI Acelll I I 

I III 
gccacacccagccaacccacacacccgccccgcgagccacacacgggaggagcgggaaga 

32761 ^ ♦ 32320 

cggcgcgggccggccggacgcgcaagcaagacgctcagcgcgcgccccccccgcccttct 

Bglll 

BseRI Drdll BscYI 
! I t 

gccggaagaaccacgccccccctcccactccaaaagaccacccaaaaccccaaaacgaag 

32821 - ♦ ♦ 32380 

cgacccccccggcacaaaaaaaaaaacaaggccccccaacaggccccggagccccacccc 

Baawi sec I 

i I 
acccaccaagcgaacgcgcccccccccggcggcgcggccaaactccacagccaaagaaca 

32881 ♦ ♦ ♦ 32940 

cagacaacccacccgcgcgaggggaggccaccgcaccagcccgagacgccggccccccgc 

gacaacggca cc cgcaagacgc cgcacaacggc c cccaaaaggcaaacggcccccacgc c 

32941 ♦ * ♦ 33000 

ccaccaccgcaaacaccccacaacgcgccaccgaaggcccceegcccgccgggagcgcag 

ECOS7I Hgilff? Figure 28W 

I II 
caagcggacgcaaaggccaaaceccccagggcgaaccccececacaaacaccccageace 
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33001 * ^ 

gcccacccgcacccccgactcggg^VgVcVcrcrc;;;;;;^^^^^ 

33 061 a ca c c c cc aagcaaac c 

aagccggcacgggctcaccUgagcVglVcg^cVgragUVciUU^^ 
Bpml 

Ssar rnt?? ! 3ce83I 

ggctcacaacccaggccgVcVaVaVcVcVaVaVg^w^^^^ 

Tchlllii 
Real I Apol 
' I ! 
33181 ^^Jl^l"^^^^^^^^ 

gctcgccgctcagcaccaaVgVcVcVaaVcVcUw^^^^ ^^^^^ 

EcoOl09r MspAir Bsgl 

PspSlI Pvull BspMI 

33241 A^Jlll^^"?^^^^ 

ccgccctgtaaccgcccctacggegceagggcacccagg§sliiiw^ 

Cael 

M3ir 

Bsgl I 

33 301 A^^^^ 

aCLagcacgcccagacgtgcccggcVgVgVcVgVglaggg^^^ 

RleAI 

I 

33361 ^^^f^f^f^^^^^^^^}}^^^^^^^9C^^^<^^ cggagcca tgccaaccagcgcagccccgatg 

cccgggcgcgaccaacaccgcgcgtacgagccUgiUcgUtggtcgcacc^^ ^^^^^ 
HlndllZ 

33421 

a ctcgaaacaacgcacccgccgccaUVtVc7cVc7ccacgicg^^^ ^ ^ ^ ® ° 

PC11108I Mall BspMI 

33481 A^A^JL^fJL'^^^^^ 

ccggagcgcgctctcccctccgcgcagcaccagVacglVtacgccUcrcc^ 

Bsawi ^^^^ 

°'^<=^" Bs^laiii Tthiuir 

II I 

33541 

* ^ 33600 



gaggccccggcggcgcctccctccgcggcaaaaagagagUcgUcIgacg^^ 



Oral 

33601 AVL\*A^A^A\*JL^^^^^ 

gcacccgcgtcccacctcaccVcVtVcVcVcallcVcgcU^^ ^"^^^ 

BglZ 

eael BsrPZl 
Gdill NgoAIVI 

33661 -^A*A*A^^AA^™ 

caaaaa 

ccctcccgccgggaacactcgcaccVtVcVcVaVgccigwc^^^ ^^"'^^ 

Bsawi 

RorPTT Taqll BspEI 

BseRI lHln4I Acelll f 

3 372 1 

t tgaccagcggcact aac t cc ccgtggcggccgVcglgglgccagc^ " ^ ® ° 

Taqii B31EI Flgurs 28WW 

33781 Al'A\\^AVA'5JL^^^^^^ 

cacactccgagccacctgcgcagcccaaccaagtagcMgUacgacccUcgcc^ 
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Taqll 
SmaX I 
Aval I I 
i I I 

acagcccgggggaacacacacccgcaggc9cagagacaacaccacaacccccaca99a9g 

33841 ♦ - — ♦ ♦ ♦ ♦ 33900 

caccgggcccccccacgcacgggegcccgcacccccgccgcaacgccgggggcacccccc 

Avrll 

vspl Styl 
I I 
cacaacaaaaccaacaggagagaaaaacacacaaacacccgaaaaacccccccgcccagg 

33901 ♦ ♦ ♦ ^ 33960 

acaccgccctaaccacccccccctcccgcgcacccgcggaccccccgggaggaeggaccc 

Haell 

Bpml BsrBI Eco47Xir | HspAlI 

I I III 

caaaacagcaccctcccgccccagaacaacacacagcgccccacagcggcagcccaacag 

33961 ♦ ♦ ♦ ♦ 34020 

gccccaccgcgggagggcgaggccccgccgcacgccgcgaagcgccgccgccggaccgcc 

Bani 

I 

ccagccccaccagcaaaaaagaaaacccaccaaaaaaacaccacccgacacggcaccagc 

34021 »-* *- 4—. ♦ 34080 

agccggaacggccacccccccccccggacaacccccccgcggcgagecgcgccgcggccg 

ActtXZI Bsgl 
I I 
ccaaccagccacagcgcaaaaaagggccaagcgcagagcgagcacacacaggaccaaaaa 

34081 * ♦ 4. * 34140 

agccagccagcgccacacccctccccggcccacgcccegcccacacacaccccgaccccc 

Taqll 
I 

acgacgcaacggccaaagcccacaaaaaaeaceeagaaaaecgcacgcgaacccacgccc 

34141 ♦ ♦ — ♦ ♦ 34200 

caccgcaccgccaaccceaggcgccccccgcgggccccccggcgcgcgcccggacgcggg 

RleAI 

I 

agaaacgaaagccaaaaaacccacaacccccccaaaccgccacccccgcccccccacgcc 

34201 ♦ > 1. ♦ 34260 

cctccgccctcggccccccgggcgctgaaggagcccagcagcgaaggcaaaagggcgcaa. 

».BsaAI V 
SnaBI Bsbl EclX 

I i I 

acgcaaccccccatcccaagaaaaccacaacccccaacacacacaagccaccccgcccca 

34261 ♦ • - ♦ ♦ ♦ 34320 

. cgcaccgaagggcaaaacceccccgacgccaagggccgcgcacgcccaacgaggcgggac 

aaacccacgtcacccgcccegttcccacgecccgegccaegccacaaaecccaccceecc 

34321 ♦ ♦ • ♦ ♦ ♦ 34380 

cccggacgcagtgggcggggcaagggcgcggggcgcggcgeagcgcccgaggcgggggag 

accaccatactggcctcaacccaaaacaaggtacaccaccgacgacg 

34381 ♦ ♦ ♦ ♦ 34427 

caacagcacaaccgaagccaggccccaccccacacaacaaccaccac 

Enzymes chac do cuc: 



Aarl 


Aacll 


ACCI 


AceXXX 


AclX 


AflXX 


ACIXXX 


AhdX 


Alol 


AlwNI 


ApaX 


ApaLX 


ApoX 


ASCX 


Aval 


AvrXX 


Bael 


BanvHI 


Ban I 


Ban XX 


Bb3X 


BbvCI 


Bce83X 


Bcgx 


BclVI 


Bell 


BglX 


BglXX 


BmgX 


Bmrx 


BplX 


BpmX 


BpulOl 


Bpull02X 


Bsal 


BsaAX 


BsaaX 


BsaHX 


BsaWX 


asaxx 


Bsbl 


BseRI 


BseSI 


BsgX 


BSIEX 


BslHKAX 


BsmX 


Bsmax 


Bsp24I 


Bspl286I 


B3PEX 


BspGX 


BspLUllX 


B3PMX 


B3rBX 


BsrDX 


BsrFI 


BsrGI 


BssKII 


B33SX 


BSCAPX 


BSCDSX 


BSCEXX 


BSCXX 


BsCYI 


BstZlTl 


B3U36X 


BCSX 


ClaX 


Drax 


. OraXXX 


DrdX 


DrdXI 


Eael 


EagX 


EarX 


EclX 


EC047XXX 


ECOS7X 


ECONX 


ECOO109X 


EcoRX 


EeoRV 


Fsel 


P3pX 


GdilX 


HaeX 


HaaXX 


Haeiv 


HglEXX 


Hln4X 


HincXI 


HlndXXX 


HpaX 


Kpnl 


MluX 


Mmel 


HscX 


Mali 


HspAlX 


MunX 


NarX 


NCOX 


NdaX 


NgoAXV 


Nhel 


NOCX 


NniX 


NslX 


NspX 


PacX 


PflllOBX 


P£1MI 


PlliAX 


PneX 


PmlX 


PshAX 


PspSXX 


PSCX 


PvuX 


Pvuil 


Real 


RlaAX 


RarXX 


SacX 


Sac XX 


SalX 


SanDX 


Sap I 


SbfX 


scaX 


SaxAI 


StcX 


SCIX 


Sg£X 


SgrAX 


• Smal 


Smll 


snaBX 


SpaX 


sphX 


SrfX 


Sse8647X 


S3PX 


. SCUl 


scyl 


SunX 


swax 


TaqXX 


TacX 


TchlllX 


TthlllXX 


UbaLX 


vspx 


Xbax 


XemX 


XhoX 


XmnX 







Enzymes chac do noc euc: 
Napv 



Figure 28XX 
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